Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/565923
Title: Performance analysis on machinelearning techniques for sparse and densely distributed big data analytics
Researcher: Kalyana Saravanan A
Guide(s): Tamilarasi A
Keywords: Big Data Analytics
Data Mining
Machine Learning
University: Anna University
Completed Date: 2022
Abstract: Big data analytics is a strategy for estimate the enormous data. The newlineprocess of Big data analytics is determined to perceive the important data. In newlineorder to owing the big data analytics is difficult task in improved volume of newlinedata by using data mining. The huge data is managed by several machine newlinelearning techniques for identifying the valuable data from large dataset. newlineHowever, the dimensionality reduction was a difficult task. From data mining, newlineclustering method is obtained to reduce the dimensionality through grouping newlinesimilar type of data in dataset. In addition, classification is one of the newlineimportant data mining methods to categorize the data into relevant classes for newlinebig data analytics. Now a lot of clustering and classification techniques were newlinedetermined to utilize the big data in an important manner. However, it failed newlineto extend the accuracy and quality of data with minimum dimensionality. To newlineovercome the above such issues, three different techniques are developed in newlinethis research for improving accuracy of big data analytics with minimum time newlineand space complexity. newlineInitially, a novel technique is called as Proximity Fuzzy Likelihood newlineMaximization Data Clustering (PFLMDC) technique. The designed technique newlineis to improve the performance of clustering by using big data analytics. The newlineproposed technique is achieving both sparse and dense data clustering. From newlinePFLMDC technique, sparse data clustering is performed by computing newlineProximity Manhattan distance. This aids to set the similar sparse data into newlinecluster with better accuracy. After that, Fuzzy Expected Maximum Likelihood newlineEstimation is applied to set the dense data into separate cluster to minimize newlinethe dimensionality. newline
Pagination: xvi,187p.
URI: http://hdl.handle.net/10603/565923
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File82.08 kBAdobe PDFView/Open
02_prelimpages.pdf3.06 MBAdobe PDFView/Open
03_contents.pdf581.36 kBAdobe PDFView/Open
04_abstracts.pdf126.83 kBAdobe PDFView/Open
05_chapter1.pdf210.77 kBAdobe PDFView/Open
06_chapter2.pdf372.71 kBAdobe PDFView/Open
07_chapter3.pdf934.05 kBAdobe PDFView/Open
08_chapter4.pdf998.4 kBAdobe PDFView/Open
09_chapter5.pdf664.31 kBAdobe PDFView/Open
10_chapter6.pdf638.72 kBAdobe PDFView/Open
11_annexures.pdf109.92 kBAdobe PDFView/Open
80_recommendation.pdf93.98 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: