Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/253226
Title: | An optimized clustering technique for higher dimensional data |
Researcher: | Banumathy D |
Guide(s): | Selvarajan S |
Keywords: | Data Mining Engineering and Technology,Computer Science,Computer Science Theory and Methods Optimized Clustering Optimized Clustering Technique |
University: | Anna University |
Completed Date: | 2018 |
Abstract: | Data mining in databases is the automatic extraction of implicit and interesting patterns from large data collections. Data mining is a field at the intersection of computer science and statistics and is the process that attempts newlineto discover patterns in large data sets. It utilizes methods at the intersection of newlineartificial intelligence, machine learning, statistics and database systems. newlineClustering is a technique in data mining which deals with huge amount of newlinedata. Clustering is intended to help a user in discovering and understanding newlinethe natural structure in a data set and abstract the meaning of large dataset. It newlineis the task of partitioning objects of a data set into distinct groups such that newlinetwo objects from one cluster are similar to each other, whereas two objects newlinefrom distinct clusters are dissimilar. newlineMany situations have been adapted in the issue of clustering high newlinedimensional data. By computation of similarity measure between the data newlinepoints of a different class of data points while clustering data points with a newlinesmall size can be done in an easier way. As the size of dimension becomes newlinebigger, the similarity measure between data point also becomes difficult and newlinethus the issues also becomes tougher. A false indexing ratio is introduces and newlinewith the multiple class names, same data point may be assigned, where newlinecomputation of similarity between data points should be done considering all newlinethe dimensional values. In this work, proposed the Fuzzy C Means (FCM) newlineclustering, Fast clustering-based feature Selection algorithm (FAST) and newlinehubness in clustering for high dimensional data. By the usage of graphtheoretic newlineclustering, the features are spit into clusters. newline newline |
Pagination: | xx, 140p. |
URI: | http://hdl.handle.net/10603/253226 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 8.17 kB | Adobe PDF | View/Open |
02_certificate.pdf | 1.02 MB | Adobe PDF | View/Open | |
03_abstract.pdf | 50.52 kB | Adobe PDF | View/Open | |
04_acknowledgment.pdf | 77.99 kB | Adobe PDF | View/Open | |
05_contents.pdf | 154.55 kB | Adobe PDF | View/Open | |
06_chapter1.pdf | 220.54 kB | Adobe PDF | View/Open | |
07_chapter2.pdf | 169.66 kB | Adobe PDF | View/Open | |
08_chapter3.pdf | 322.61 kB | Adobe PDF | View/Open | |
09_chapter4.pdf | 231.86 kB | Adobe PDF | View/Open | |
10_chapter5.pdf | 303.08 kB | Adobe PDF | View/Open | |
11_conclusion.pdf | 57.15 kB | Adobe PDF | View/Open | |
12_references.pdf | 154.42 kB | Adobe PDF | View/Open | |
13_publications.pdf | 114.15 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: