Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/224840
Title: Multilevel Association Rule Mining in Distributed Environment
Researcher: Prajapati Dinesh
Guide(s): Garg Sanjay
Keywords: Engineering and Technology,Computer Science,Computer Science Software Engineering
University: Nirma University
Completed Date: 08/08/2018
Abstract: Multilevel association rule discovers knowledge from conceptual hierarchical data newlineset and thus provides more significant information than single level association rule. However, newlineexisting multilevel association rule mining algorithms have limitation of processing newlinespeed while analyzing big data. To overcome this, Hadoop-based distributed multilevel newlineassociation rule mining approach is proposed which process the transactional dataset into newlinepartitions then transfers each task to all participating nodes. Thus, it reduces inter node newlinemessage passing in the cluster. newlineThe proposed methodology is applied in two phases. In the first phase, the transactional newlinedataset is generated from big sales dataset using Hadoop MapReduce framework. newlineThen, a proposed distributed multilevel frequent pattern mining algorithms MR-MLAB newline(MapReduce based Multilevel Apriori using Bottom-up Approach) and MR-MLAT (MapReduce newlinebased Multilevel Apriori using Top-down Approach) are used to generate level-crossing newlinefrequent itemset for each level of concept hierarchy. Performance of the system is compared newlinebased on minimum support threshold at different level of concept hierarchy and also newlineby varying dataset size. Moreover, time efficiency of proposed algorithms is compared newlinewith existing Traditional Multilevel Apriori (TMLA) algorithm. Due to ancestor relationship, newlinethis proposed distributed multilevel frequent pattern mining algorithm generates huge newlineamount of hierarchical redundancy. Thus, to improve the performance of the system, such newlinehierarchical redundancy needs to be eliminated. In second phase, distributed multilevel newlinefrequent pattern mining algorithm is applied on regional transactional dataset to generate newlinefrequent k-itemsets for each region. Then, multilevel association rules are generated for newlineeach region. These generated regional multilevel rules are so large that it becomes complex newlineto analyze it using traditional methods. Hence, MR-MCIRD (MapReduce based Multilevel newlineConsistent and Inconsistent Rule Detection) algorithm is proposed to derive cons
URI: http://hdl.handle.net/10603/224840
Appears in Departments:Institute of Technology

Files in This Item:
File Description SizeFormat 
02. certificate final.pdfAttached File143.35 kBAdobe PDFView/Open
06_contents.pdf.pdf45.5 kBAdobe PDFView/Open
07_list_of_tables.pdf.pdf24.71 kBAdobe PDFView/Open
08_list_of_figures.pdf.pdf25.46 kBAdobe PDFView/Open
10_chapter1. pdf.pdf294.17 kBAdobe PDFView/Open
11_chapter2. pdf.pdf189.08 kBAdobe PDFView/Open
12_chapter3. pdf.pdf81.43 kBAdobe PDFView/Open
13_chapter4. pdf.pdf842.77 kBAdobe PDFView/Open
14_chapter5. pdf.pdf988.81 kBAdobe PDFView/Open
15_chapter6. pdf.pdf456.6 kBAdobe PDFView/Open
16_conclusion. pdf.pdf29.17 kBAdobe PDFView/Open
18_bibliography.pdf.pdf36.22 kBAdobe PDFView/Open
1_title.pdf.pdf33.67 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: