Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/9101
Title: | Optimization of association rules in Data Mining using Parallel Approach |
Researcher: | Shah, Ketan |
Guide(s): | Mahajan, Sunita |
Keywords: | Data Mining Technology Management |
Upload Date: | 23-May-2013 |
University: | Narsee Monjee Institute of Management Studies |
Completed Date: | 23/07/2011 |
Abstract: | Data mining is the process of automatic extraction of novel, useful, and understandable patterns in very large datasets. One of the most important problems in Data Mining [1, 6] is discovering association rules. An example of association rule is “30% of all customers who buy jackets and gloves also buy hiking boots”. The association rule problem is to find all such rules whose frequency is greater than some user-specified minimum. One of the key features of previous algorithms developed is that they require multiple passes over the datasets. Over time, as datasets continue to grow inexorably in size and complexity, association rule problem demands more and more computational power. High performance scalable and parallel computing thus becomes crucial for ensuring system scalability and interactivity. This thesis deals with both algorithmic and system aspects for achieving optimization of association rules in data mining using parallel approach. The algorithmic aspect of optimization focuses on the design of efficient, scalable, parallel algorithm for association rules. We assume that the datasets are very large and disk-resident. Computer clusters are commonly used to increase the computational power for solving data mining problems. Clusters can contain homogeneous machines i.e. machines having same CPU speed and memory or they could be collection of heterogeneous machines which would have different CPU speeds and memory. The system aspect deals with scalable implementation on homogeneous and heterogeneous collection of networked workstations. It is commonly observed that more the number of components in a system, the probability of failure increases. |
Pagination: | 135p. |
URI: | http://hdl.handle.net/10603/9101 |
Appears in Departments: | Department of Technology Management |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_ title page.pdf | Attached File | 17.81 kB | Adobe PDF | View/Open |
02_table of contents.pdf | 19.41 kB | Adobe PDF | View/Open | |
03_abstract.pdf | 15.83 kB | Adobe PDF | View/Open | |
04_ list of tables and figures.pdf | 51.24 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 80.05 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 177.71 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 108.61 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 153.21 kB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 496.66 kB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 465.06 kB | Adobe PDF | View/Open | |
11_chapter 7.pdf | 28.62 kB | Adobe PDF | View/Open | |
12_appendix.pdf | 115.95 kB | Adobe PDF | View/Open | |
13_reference.pdf | 35.17 kB | Adobe PDF | View/Open | |
14_ publications.pdf | 13.12 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: