Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/9847
Title: An investigation on automatic support thresholds for association rule mining
Researcher: Kanimozhiselvi C S
Guide(s): Tamilarasi A
Keywords: Knowledge Discovery
Data mining
Rule mining
Exact association rule
Upload Date: 11-Jul-2013
University: Anna University
Completed Date: 01/06/2011
Abstract: The process of Knowledge Discovery in Database is aimed at extracting useful information from large databases. Among the several steps of Knowledge Discovery in Databases, data mining is the core step. Data mining is the extraction of hidden, predictive information that are implicitly stored in large databases. An association rule will be interesting, if the support and confidence measures of the rules are greater than the user specified support and confidence thresholds. These thresholds play an important role in deciding the quantity and quality of the association rules. The research focuses on assigning automatic support thresholds for the mining of frequent and rare association rules. In view of this, four approaches have been presented in the thesis. The first approach emphasizes the need for levelwise calculation of non uniform support thresholds. In the first approach, a method has been proposed to compute the support thresholds by analyzing the support distribution of items and used for rule mining. Also another method has been proposed to compute support thresholds based on the previous support thresholds. These methods are helpful in extracting large frequent itemsets that are interesting in nature. The first approach is based on the assumption that the items at all levels have same nature and frequency in the database. The need for itemwise support thresholds has been investigated in the second approach. Also, an algorithm based on Confidence Lift Support measure algorithm has been developed to assign support threshold for each item. The algorithm extracts low support but high confidence association rules. The real world datasets consists of items that are of non uniform in nature. Some items appear frequently in dataset and some of them appear rarely. The third approach concentrates on the extraction of rare itemsets. Finally, an approach to deal with the mining of exact association rules has been presented.
Pagination: xvii, 129p.
URI: http://hdl.handle.net/10603/9847
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File49.53 kBAdobe PDFView/Open
02_certificates.pdf678.42 kBAdobe PDFView/Open
03_abstract.pdf12.39 kBAdobe PDFView/Open
04_acknowledgement.pdf13.39 kBAdobe PDFView/Open
05_contents.pdf44.06 kBAdobe PDFView/Open
06_chapter 1.pdf40.71 kBAdobe PDFView/Open
07_chapter 2.pdf47.98 kBAdobe PDFView/Open
08_chapter 3.pdf198.07 kBAdobe PDFView/Open
09_chapter 4.pdf105.06 kBAdobe PDFView/Open
10_chapter 5.pdf63.07 kBAdobe PDFView/Open
11_chapter 6.pdf57 kBAdobe PDFView/Open
12_chapter 7.pdf23.69 kBAdobe PDFView/Open
13_appendix.pdf64.94 kBAdobe PDFView/Open
14_references.pdf27.52 kBAdobe PDFView/Open
15_publications.pdf16.47 kBAdobe PDFView/Open
16_vitae.pdf11.41 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: