Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/317973
Title: Novel Initial Seed Selection Methodology for Partitional Clustering Algorithms
Researcher: Sajidha,S A
Guide(s): Kalyani Desikan
Keywords: Computer Science
Computer Science Interdisciplinary Applications
Engineering and Technology
University: VIT University
Completed Date: 2020
Abstract: The underlying open issues in the partitional clustering algorithms such as K means newlineand K modes algorithms are as follows - random initial seed point selection, identifying the number of clusters, clustering tendency, handling empty clusters, identifying outliers and so on. Many authors have proposed different techniques to identify initial seed points which may involve setting of values for parameters, randomisation etc. This may not generate clustering solution having the minimal number of misclassifications. Thus, a clustering solution having high intra-cluster similarity and very low inter-cluster similarity cannot be assured. Also the same clustering solution which satisfies the above condition cannot be generated every time the clustering algorithm is executed. Therefore, it is important to identify the initial seed points which are representative points of the clusters of the final clustering solution. This ensures the final clustering solution to have high intra-cluster similarity and low inter-cluster similarity. Since the initial seed points are identified the final clustering solution can be regenerated everytime the clustering algorithm is executed. We have, hence, put forth a novel and simple methodology to overcome the problem of initial seed point selection which has a major impact on the final clustering solution. Our methodology ensures that the clustering solution is a repeatable one with minimal number of misclassifications compared to the existing newlineclustering techniques or generates the same clustering solution as that of the existing algorithms. This is not possible using K means clustering algorithm as the initial seeds are selected randomly during the clustering process. In K means clustering algorithm one needs to make all possible enumerations to find the clustering solution having minimal number of misclassifications. The proposed methodology overcomes this problem by selecting the seed points which are well separated from each other such that they fall into different clusters of the fina
Pagination: i-viii, 1-177
URI: http://hdl.handle.net/10603/317973
Appears in Departments:School of Computing Science and Engineering -VIT-Chennai

Files in This Item:
File Description SizeFormat 
01_ title page.pdfAttached File102.76 kBAdobe PDFView/Open
02_ signed copy of declaration_&_certificate.pdf81.13 kBAdobe PDFView/Open
03_ abstract.pdf57.89 kBAdobe PDFView/Open
04_content.pdf42.86 kBAdobe PDFView/Open
05_ list of tables.pdf61.88 kBAdobe PDFView/Open
06_ list of figures.pdf60.99 kBAdobe PDFView/Open
07_ acknowledgement.pdf40.84 kBAdobe PDFView/Open
08_ chapter-1.pdf364.02 kBAdobe PDFView/Open
09_ chapter-2.pdf195.76 kBAdobe PDFView/Open
10_ chapter-3.pdf451.55 kBAdobe PDFView/Open
11_ chapter-4.pdf308.12 kBAdobe PDFView/Open
12_ chapter-5.pdf403.73 kBAdobe PDFView/Open
13_ chapter-6.pdf81.55 kBAdobe PDFView/Open
14_ chapter-7.pdf49.51 kBAdobe PDFView/Open
15_ references.pdf73.43 kBAdobe PDFView/Open
16_ list of publications.pdf40.19 kBAdobe PDFView/Open
80_recommendation.pdf245.21 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: