Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/306579
Title: Knowledge Discovery in Databases Processing Using Improved Data Mining Techniques
Researcher: Manchanda, Sanjeev
Guide(s): Dave, Mayank and Singh, S,B.
Keywords: Data Mining
KDD
Machine Learning
University: Thapar Institute of Engineering and Technology
Completed Date: 2010
Abstract: This thesis focuses on problems related to supervised learning for data mining and system development for knowledge based systems. These problems have been analyzed and an effort have been made to find solutions for the problems related to earlier processes and models for supervised learning and knowledge based system development respectively. Initially this research work concentrates upon investigating past developments in the areas of data mining, supervised learning as well as software and knowledge engineering to find the past developments through literature survey. Then this research work concentrates upon enumerating different processing methods used for supervised learning. These different processes are devised through a general Knowledge Discovery in Databases (KDD) process. From this general process, six specific processes were identified to be widely practiced worldwide. After investigating different processes for supervised learning, limitations of these methods are identified. After identifying problems related to different supervised learning processes, investigations are directed towards finding the solutions for these problems. Search for finding solutions of these problems motivated to develop a new process for supervised learning. This motivation resulted in the form of a new process named as Fuzzy Boundaries of Regression Based Clusters process. Proposed process is based on parallel processing of classification as well as regression algorithms. Proposed process allows pruning the training data without compromising the performance of outcome. It reduces the size of the train set significantly, while analyzing each record of data qualitatively as well as quantitatively through classification and regression algorithms respectively. Proposed process as well as previously known processes are applied on twenty classification algorithms and five regression algorithms over ten datasets gathered from internationally renowned organizations.
Pagination: 167p.
URI: http://hdl.handle.net/10603/306579
Appears in Departments:School of Mathematics

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File118.94 kBAdobe PDFView/Open
02_dedication.pdf96.24 kBAdobe PDFView/Open
03_acknowledgements.pdf99.97 kBAdobe PDFView/Open
04_declaration.pdf293.71 kBAdobe PDFView/Open
05_abstract.pdf105.54 kBAdobe PDFView/Open
06_contents.pdf121.55 kBAdobe PDFView/Open
07_list of figures.pdf105.12 kBAdobe PDFView/Open
08_list of tables.pdf101.33 kBAdobe PDFView/Open
09_chapter 1.pdf137.95 kBAdobe PDFView/Open
10_chapter 2.pdf195.98 kBAdobe PDFView/Open
11_chapter 3.pdf208.78 kBAdobe PDFView/Open
12_chapter 4.pdf255.48 kBAdobe PDFView/Open
13_chapter 5.pdf339.8 kBAdobe PDFView/Open
14_chapter 6.pdf151.58 kBAdobe PDFView/Open
15_chapter 7.pdf1.77 MBAdobe PDFView/Open
16_references.pdf187.56 kBAdobe PDFView/Open
17_list of publications.pdf85.9 kBAdobe PDFView/Open
18_appendix a.pdf121.47 kBAdobe PDFView/Open
19_appendix b.pdf117.77 kBAdobe PDFView/Open
80_recommendation.pdf318.42 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: