An optimized clustering technique for higher dimensional data

Banumathy D

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/253226

Title:	An optimized clustering technique for higher dimensional data
Researcher:	Banumathy D
Guide(s):	Selvarajan S
Keywords:	Data Mining Engineering and Technology,Computer Science,Computer Science Theory and Methods Optimized Clustering Optimized Clustering Technique
University:	Anna University
Completed Date:	2018
Abstract:	Data mining in databases is the automatic extraction of implicit and interesting patterns from large data collections. Data mining is a field at the intersection of computer science and statistics and is the process that attempts newlineto discover patterns in large data sets. It utilizes methods at the intersection of newlineartificial intelligence, machine learning, statistics and database systems. newlineClustering is a technique in data mining which deals with huge amount of newlinedata. Clustering is intended to help a user in discovering and understanding newlinethe natural structure in a data set and abstract the meaning of large dataset. It newlineis the task of partitioning objects of a data set into distinct groups such that newlinetwo objects from one cluster are similar to each other, whereas two objects newlinefrom distinct clusters are dissimilar. newlineMany situations have been adapted in the issue of clustering high newlinedimensional data. By computation of similarity measure between the data newlinepoints of a different class of data points while clustering data points with a newlinesmall size can be done in an easier way. As the size of dimension becomes newlinebigger, the similarity measure between data point also becomes difficult and newlinethus the issues also becomes tougher. A false indexing ratio is introduces and newlinewith the multiple class names, same data point may be assigned, where newlinecomputation of similarity between data points should be done considering all newlinethe dimensional values. In this work, proposed the Fuzzy C Means (FCM) newlineclustering, Fast clustering-based feature Selection algorithm (FAST) and newlinehubness in clustering for high dimensional data. By the usage of graphtheoretic newlineclustering, the features are spit into clusters. newline newline
Pagination:	xx, 140p.
URI:	http://hdl.handle.net/10603/253226
Appears in Departments:	Faculty of Information and Communication Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	8.17 kB	Adobe PDF	View/Open
02_certificate.pdf		1.02 MB	Adobe PDF	View/Open
03_abstract.pdf		50.52 kB	Adobe PDF	View/Open
04_acknowledgment.pdf		77.99 kB	Adobe PDF	View/Open
05_contents.pdf		154.55 kB	Adobe PDF	View/Open
06_chapter1.pdf		220.54 kB	Adobe PDF	View/Open
07_chapter2.pdf		169.66 kB	Adobe PDF	View/Open
08_chapter3.pdf		322.61 kB	Adobe PDF	View/Open
09_chapter4.pdf		231.86 kB	Adobe PDF	View/Open
10_chapter5.pdf		303.08 kB	Adobe PDF	View/Open
11_conclusion.pdf		57.15 kB	Adobe PDF	View/Open
12_references.pdf		154.42 kB	Adobe PDF	View/Open
13_publications.pdf		114.15 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET