Efficient Pattern Mining of Big Data using Graphs

Bhatia, Vandana

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/326271

Title:	Efficient Pattern Mining of Big Data using Graphs
Researcher:	Bhatia, Vandana
Guide(s):	Rani, Rinkle
Keywords:	Big data Graph Mining Pattern Mining
University:	Thapar Institute of Engineering and Technology
Completed Date:	2018
Abstract:	Big data has great amount of hidden knowledge and many insights which have raised remarkable challenges in knowledge discovery and data mining. For certain types of data, the relationship among the entities is of much more importance than the information itself. Big data has many such connections which can be mined efficiently using graphs. However, it is very challenging to obtain ample profits from this complex data. To overcome these challenges, graph mining approaches such as clustering and subgraph mining are used. In recent times, these approaches have become an indispensable tool for analyzing graphs in various domains. This thesis presents research work undertaken in the field of pattern mining approaches for large graphs. The main objective of this research is to investigate the benefits of using scalable approaches for mining large graphs. Two fuzzy clustering algorithms namely PGFCand#8223; and PFCAand#8223; are proposed for large graphs using different concepts of graph analysis. Furthermore, a scalable deep learning based fuzzy clustering model named DFuzzyand#8223; is proposed that leverages the idea from stacked autoencoder pipelines to identify overlapping and non-overlapping clusters in large graphs efficiently. Our proposed clustering approaches are proved to be effective for small and large graph dataset, and generate high quality clusters. For mining frequent subgraphs, a scalable frequent subgraph mining algorithm named PaGroand#8223; is proposed for a single large graph using pattern-growth based approach. In PaGro, a two-step hybrid approach is developed for optimization of subgraph isomorphism and subgraph pruning task at both local and global levels to avoid the excess communication overhead. Additionally, an approximate frequent subgraph mining algorithm named Ap- FSMand#8223; is proposed which exploits PaGro using sampling for faster processing.
Pagination:	158p.
URI:	http://hdl.handle.net/10603/326271
Appears in Departments:	Department of Computer Science and Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	92.98 kB	Adobe PDF	View/Open
02_table of contents.pdf		96.6 kB	Adobe PDF	View/Open
03_list of figures.pdf		87.58 kB	Adobe PDF	View/Open
04_list of tables.pdf		81.93 kB	Adobe PDF	View/Open
05_certificate.pdf		166.6 kB	Adobe PDF	View/Open
06_acknowledgement.pdf		22.41 kB	Adobe PDF	View/Open
07_abstract.pdf		83.87 kB	Adobe PDF	View/Open
08_chapter 1.pdf		615.94 kB	Adobe PDF	View/Open
09_chapter 2.pdf		875.06 kB	Adobe PDF	View/Open
10_chapter 3.pdf		207.51 kB	Adobe PDF	View/Open
11_chapter 4.pdf		720.98 kB	Adobe PDF	View/Open
12_chapter 5.pdf		928.41 kB	Adobe PDF	View/Open
13_chapter 6.pdf		1.58 MB	Adobe PDF	View/Open
14_chapter 7.pdf		205.82 kB	Adobe PDF	View/Open
15_bibliography.pdf		351.26 kB	Adobe PDF	View/Open
16_list of publications.pdf		170.17 kB	Adobe PDF	View/Open
17_conferences.pdf		166.83 kB	Adobe PDF	View/Open
80_recommendation.pdf		297.74 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET