Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/9101
Title: Optimization of association rules in Data Mining using Parallel Approach
Researcher: Shah, Ketan
Guide(s): Mahajan, Sunita
Keywords: Data Mining
Technology Management
Upload Date: 23-May-2013
University: Narsee Monjee Institute of Management Studies
Completed Date: 23/07/2011
Abstract: Data mining is the process of automatic extraction of novel, useful, and understandable patterns in very large datasets. One of the most important problems in Data Mining [1, 6] is discovering association rules. An example of association rule is “30% of all customers who buy jackets and gloves also buy hiking boots”. The association rule problem is to find all such rules whose frequency is greater than some user-specified minimum. One of the key features of previous algorithms developed is that they require multiple passes over the datasets. Over time, as datasets continue to grow inexorably in size and complexity, association rule problem demands more and more computational power. High performance scalable and parallel computing thus becomes crucial for ensuring system scalability and interactivity. This thesis deals with both algorithmic and system aspects for achieving optimization of association rules in data mining using parallel approach. The algorithmic aspect of optimization focuses on the design of efficient, scalable, parallel algorithm for association rules. We assume that the datasets are very large and disk-resident. Computer clusters are commonly used to increase the computational power for solving data mining problems. Clusters can contain homogeneous machines i.e. machines having same CPU speed and memory or they could be collection of heterogeneous machines which would have different CPU speeds and memory. The system aspect deals with scalable implementation on homogeneous and heterogeneous collection of networked workstations. It is commonly observed that more the number of components in a system, the probability of failure increases.
Pagination: 135p.
URI: http://hdl.handle.net/10603/9101
Appears in Departments:Department of Technology Management

Files in This Item:
File Description SizeFormat 
01_ title page.pdfAttached File17.81 kBAdobe PDFView/Open
02_table of contents.pdf19.41 kBAdobe PDFView/Open
03_abstract.pdf15.83 kBAdobe PDFView/Open
04_ list of tables and figures.pdf51.24 kBAdobe PDFView/Open
05_chapter 1.pdf80.05 kBAdobe PDFView/Open
06_chapter 2.pdf177.71 kBAdobe PDFView/Open
07_chapter 3.pdf108.61 kBAdobe PDFView/Open
08_chapter 4.pdf153.21 kBAdobe PDFView/Open
09_chapter 5.pdf496.66 kBAdobe PDFView/Open
10_chapter 6.pdf465.06 kBAdobe PDFView/Open
11_chapter 7.pdf28.62 kBAdobe PDFView/Open
12_appendix.pdf115.95 kBAdobe PDFView/Open
13_reference.pdf35.17 kBAdobe PDFView/Open
14_ publications.pdf13.12 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: