Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/253226
Title: An optimized clustering technique for higher dimensional data
Researcher: Banumathy D
Guide(s): Selvarajan S
Keywords: Data Mining
Engineering and Technology,Computer Science,Computer Science Theory and Methods
Optimized Clustering
Optimized Clustering Technique
University: Anna University
Completed Date: 2018
Abstract: Data mining in databases is the automatic extraction of implicit and interesting patterns from large data collections. Data mining is a field at the intersection of computer science and statistics and is the process that attempts newlineto discover patterns in large data sets. It utilizes methods at the intersection of newlineartificial intelligence, machine learning, statistics and database systems. newlineClustering is a technique in data mining which deals with huge amount of newlinedata. Clustering is intended to help a user in discovering and understanding newlinethe natural structure in a data set and abstract the meaning of large dataset. It newlineis the task of partitioning objects of a data set into distinct groups such that newlinetwo objects from one cluster are similar to each other, whereas two objects newlinefrom distinct clusters are dissimilar. newlineMany situations have been adapted in the issue of clustering high newlinedimensional data. By computation of similarity measure between the data newlinepoints of a different class of data points while clustering data points with a newlinesmall size can be done in an easier way. As the size of dimension becomes newlinebigger, the similarity measure between data point also becomes difficult and newlinethus the issues also becomes tougher. A false indexing ratio is introduces and newlinewith the multiple class names, same data point may be assigned, where newlinecomputation of similarity between data points should be done considering all newlinethe dimensional values. In this work, proposed the Fuzzy C Means (FCM) newlineclustering, Fast clustering-based feature Selection algorithm (FAST) and newlinehubness in clustering for high dimensional data. By the usage of graphtheoretic newlineclustering, the features are spit into clusters. newline newline
Pagination: xx, 140p.
URI: http://hdl.handle.net/10603/253226
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File8.17 kBAdobe PDFView/Open
02_certificate.pdf1.02 MBAdobe PDFView/Open
03_abstract.pdf50.52 kBAdobe PDFView/Open
04_acknowledgment.pdf77.99 kBAdobe PDFView/Open
05_contents.pdf154.55 kBAdobe PDFView/Open
06_chapter1.pdf220.54 kBAdobe PDFView/Open
07_chapter2.pdf169.66 kBAdobe PDFView/Open
08_chapter3.pdf322.61 kBAdobe PDFView/Open
09_chapter4.pdf231.86 kBAdobe PDFView/Open
10_chapter5.pdf303.08 kBAdobe PDFView/Open
11_conclusion.pdf57.15 kBAdobe PDFView/Open
12_references.pdf154.42 kBAdobe PDFView/Open
13_publications.pdf114.15 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: