Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/592597
Title: | Efficient machine learning models for handling class imbalance concept drift and dimensionality reduction in big data |
Researcher: | Aarthi R J |
Guide(s): | Vinayagasundaram B |
Keywords: | AutoEncoder Big data Machine Learning |
University: | Anna University |
Completed Date: | 2022 |
Abstract: | Big data revolution in various domains offers unprecedented newlineopportunities for insight and innovation. The key challenges that affect data newlinestorage and analysis are concept drift, Dimensionality Reduction, and Class newlineimbalance problems, which are significant and critically evaluated in this newlineresearch work. newlineThe accurate prediction of extreme weather occurrences is significantly newlinehindered by the class imbalance problem. Handling these issues with traditional newlinemachine learning models is found to be difficult. This laid the path to newlinedeveloping a hybrid of the AutoEncoder (AE), Synthetic Minority class newlineOversampling (SMOTE), and Extreme Learning Machine (ELM) newline(AE_SMOTE_ELM) method to perform binary classification on four newlineimbalanced climate data sets. The proposed method also mitigates the struggle newlineto learn from imbalanced datasets, as they tend to prioritize the majority class, newlineneglecting minority classes, thereby improving the predictive performance of newlinemodels on imbalanced data. This model, when compared with other state-ofthe- newlineart deep learning methods, shows better accuracy and efficiency of 90.4% newlineand 91.76%, respectively. newlineConventional machine learning methods encounter difficulties such as newlineincreased computational complexity, overfitting, and trouble interpreting the newlineresults when dealing with high-dimensional datasets. To overcome this issue, newlinethe proposed Dimensionally Reduced Optimal Skyline Evaluation (DROSE) newlinealgorithm improves stability and performance in a high-dimensional newlineenvironment. The k-dominant optimal skylines are detected with k-means newlineclustering of the DROSE newline |
Pagination: | xvii,144p. |
URI: | http://hdl.handle.net/10603/592597 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 234.21 kB | Adobe PDF | View/Open |
02_prelim_pages.pdf | 407.09 kB | Adobe PDF | View/Open | |
03_contents.pdf | 208.6 kB | Adobe PDF | View/Open | |
04_abstracts.pdf | 182.81 kB | Adobe PDF | View/Open | |
05_chapter1.pdf | 210.03 kB | Adobe PDF | View/Open | |
06_chapter2.pdf | 251.71 kB | Adobe PDF | View/Open | |
07_chapter3.pdf | 1.08 MB | Adobe PDF | View/Open | |
08_chapter4.pdf | 1.11 MB | Adobe PDF | View/Open | |
09_chapter5.pdf | 930.53 kB | Adobe PDF | View/Open | |
10_chapter6.pdf | 702.02 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 92.9 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 89.33 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: