Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/326271
Title: Efficient Pattern Mining of Big Data using Graphs
Researcher: Bhatia, Vandana
Guide(s): Rani, Rinkle
Keywords: Big data
Graph Mining
Pattern Mining
University: Thapar Institute of Engineering and Technology
Completed Date: 2018
Abstract: Big data has great amount of hidden knowledge and many insights which have raised remarkable challenges in knowledge discovery and data mining. For certain types of data, the relationship among the entities is of much more importance than the information itself. Big data has many such connections which can be mined efficiently using graphs. However, it is very challenging to obtain ample profits from this complex data. To overcome these challenges, graph mining approaches such as clustering and subgraph mining are used. In recent times, these approaches have become an indispensable tool for analyzing graphs in various domains. This thesis presents research work undertaken in the field of pattern mining approaches for large graphs. The main objective of this research is to investigate the benefits of using scalable approaches for mining large graphs. Two fuzzy clustering algorithms namely PGFCand#8223; and PFCAand#8223; are proposed for large graphs using different concepts of graph analysis. Furthermore, a scalable deep learning based fuzzy clustering model named DFuzzyand#8223; is proposed that leverages the idea from stacked autoencoder pipelines to identify overlapping and non-overlapping clusters in large graphs efficiently. Our proposed clustering approaches are proved to be effective for small and large graph dataset, and generate high quality clusters. For mining frequent subgraphs, a scalable frequent subgraph mining algorithm named PaGroand#8223; is proposed for a single large graph using pattern-growth based approach. In PaGro, a two-step hybrid approach is developed for optimization of subgraph isomorphism and subgraph pruning task at both local and global levels to avoid the excess communication overhead. Additionally, an approximate frequent subgraph mining algorithm named Ap- FSMand#8223; is proposed which exploits PaGro using sampling for faster processing.
Pagination: 158p.
URI: http://hdl.handle.net/10603/326271
Appears in Departments:Department of Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File92.98 kBAdobe PDFView/Open
02_table of contents.pdf96.6 kBAdobe PDFView/Open
03_list of figures.pdf87.58 kBAdobe PDFView/Open
04_list of tables.pdf81.93 kBAdobe PDFView/Open
05_certificate.pdf166.6 kBAdobe PDFView/Open
06_acknowledgement.pdf22.41 kBAdobe PDFView/Open
07_abstract.pdf83.87 kBAdobe PDFView/Open
08_chapter 1.pdf615.94 kBAdobe PDFView/Open
09_chapter 2.pdf875.06 kBAdobe PDFView/Open
10_chapter 3.pdf207.51 kBAdobe PDFView/Open
11_chapter 4.pdf720.98 kBAdobe PDFView/Open
12_chapter 5.pdf928.41 kBAdobe PDFView/Open
13_chapter 6.pdf1.58 MBAdobe PDFView/Open
14_chapter 7.pdf205.82 kBAdobe PDFView/Open
15_bibliography.pdf351.26 kBAdobe PDFView/Open
16_list of publications.pdf170.17 kBAdobe PDFView/Open
17_conferences.pdf166.83 kBAdobe PDFView/Open
80_recommendation.pdf297.74 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: