Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/324563
Title: Studies on Design of Ensembles for Efficient Learning of Diabetes Dataset
Researcher: LAVANYA, T
Guide(s): KUMARAVEL, A
Keywords: Computer Science
Computer Science Theory and Methods
Engineering and Technology
University: Bharath University
Completed Date: 2017
Abstract: Even though there are many factors influencing the final results of the predicting exercises, we should be very careful enough to select the list of most important ones especially in the context like diagnosing diabetes diseases. We focus on dataset size, cost sensitiveness, regional influence, and prioritization of the features. Mining the data sets of different sizes or different regions many times need not yield similar results with expected maximum accuracy. Hence the data size or inherent regional characteristics act as important parameters for mining exercises. In this research studies firstly we consider data sets from two different geographical regions and the calculation of performance measures separately. Also, we get the same for integrated data set obtained by the union of the original sets independently as inverse results establishing the hypothesis for integrated data set. Secondly we consider the issue of mechanizing the prediction of new patients heart disease diagnosis based on data mining on historical data is an extremely useful tool. newlineResearch in ensemble methods has largely revolved around designing ensembles consisting of competent yet complementary models. Ensemble Methods began about fifteen years ago as a separate research area within machine learning and were motivated by the idea of wanting to leverage the power of multiple models and not just trust one model built on a small training set. Significant theoretical and experimental developments have occurred over the past fifteen years and have led to several methods, especially bagging and boosting, being used to solve many real problems. However, ensemble methods also appear to be applicable to current and upcoming problems of distributed data mining, online applications, and others. Therefore, practitioners in data mining should stay tuned for further developments in the vibrant area of ensemble methods. We apply this type of ensemble design for the critical data set like that for Diabetes. newline newline
Pagination: 
URI: http://hdl.handle.net/10603/324563
Appears in Departments:Department of Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
80_recommendation.pdfAttached File559.7 kBAdobe PDFView/Open
certificate.pdf101.38 kBAdobe PDFView/Open
chapter 1.pdf205.58 kBAdobe PDFView/Open
chapter 2.pdf234.59 kBAdobe PDFView/Open
chapter 3.pdf730.9 kBAdobe PDFView/Open
chapter 4.pdf1.03 MBAdobe PDFView/Open
chapter 5.pdf314.76 kBAdobe PDFView/Open
chapter 6.pdf283.96 kBAdobe PDFView/Open
preliminary pages.pdf422.53 kBAdobe PDFView/Open
title page.pdf289.84 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: