Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/187113
Title: Data Mining for Biological and Environmental Problems
Researcher: Pooja Shrivastava
Guide(s): Kavita
University: Jayoti Vidyapeeth Women s University
Completed Date: 
Abstract: Data Mining is a knowledge discovery from data and it treats as mining of newlineknowledge from large amount of data in every field. The algorithms are newlineimplemented using MATLAB and fuzzy logic tool box and results are evaluated newlinebased on performance parameter in both algorithms. After doing this research newlineexperiment results show that how k-means and fuzzy C means implemented on newlineprotein data set. In this research work we present the problem that show proteins are newlinehighly affiliated to each other. newlineFCM allows one piece of data to belong to two or more clusters. Results newlinebased on different clusters in both algorithms. K-means is the centroid based newlinetechnique. We are also compared k-means and FCM results in this research. newlineComparison results show that the k-means is better than FCM. With the help of this newlineresearch we can remove complexity from data sets in future. So the result shows that newlineproteins are close to each other and k-means algorithm remove data set complexity newlinewith high accuracy and less consuming time and found large sum of distance in newlineamong the statistics peak s association to FCM algorithm. Data mining techniques is newlinevery important in the analysis of real environmental data. Forest fire is important to newlinethe forest ecosystem. Only few research focus on the scientific data. It is very newlinedifficult task. In this research show that the comparison results using bagging, newlinestacking and random subspace algorithms taking place forest fire figures locate in to newlineWEKA statistics mining suite. Bagging, stacking and random subspace algorithms newlineare implemented using WEKA and experiments are behavior and consequences are newlineassessing based on performance. Finally, we compared performance of bagging, newlinestacking and random subspace algorithm. On the basis of experiments, we have newlinefound that using 20-fold cross validation then performance of the stacking with newlinedecision stump and decision table improve the prediction accuracy of classifier. So newlinestacking is better and straightforward to interpret other. Stacking algorithm built newlineaccurate classifier model a
Pagination: 
URI: http://hdl.handle.net/10603/187113
Appears in Departments:Department of Computer Science and Information Communication Technology

Files in This Item:
File Description SizeFormat 
abbriviations.pdfAttached File48.99 kBAdobe PDFView/Open
abstarct.pdf56.38 kBAdobe PDFView/Open
acknowledgement.pdf62.26 kBAdobe PDFView/Open
appendices.pdf155.15 kBAdobe PDFView/Open
bibliogaphy.pdf124.52 kBAdobe PDFView/Open
certificate.pdf120.14 kBAdobe PDFView/Open
chap_1.pdf1.07 MBAdobe PDFView/Open
chap_2.pdf136.96 kBAdobe PDFView/Open
chap_3.pdf190.36 kBAdobe PDFView/Open
chap_4.pdf2.95 MBAdobe PDFView/Open
chap_5.pdf92.32 kBAdobe PDFView/Open
contents.pdf94.65 kBAdobe PDFView/Open
dedication.pdf1.34 MBAdobe PDFView/Open
title page.pdf5.92 MBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: