Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/577142
Title: | A neural network framework for enhancing sentiment analysis |
Researcher: | Sharma,Harshit |
Guide(s): | Ajmera, Reema |
Keywords: | Computer Science Computer Science Software Engineering Engineering and Technology |
University: | Nirwan University Jaipur |
Completed Date: | 2024 |
Abstract: | This research presents a thorough design and assessment of various machine learning models newlineapplied to sentiment analysis of Twitter data. The primary focus was to assess and compare the newlineeffectiveness of different classifiers, namely Logistic Regression, Linear SVM, Random Forest, newlineDecision Tree, XG Boost, and Naive Bayes, in categorizing tweets into positive and negative newlinesentiments. The evaluation criteria included metrics such as accuracy, precision, recall, F1-score, newlineand support, with a particular emphasis on both training and testing data accuracies. Key findings newlinerevealed that Logistic Regression and Linear SVM models exhibited high accuracy and balanced newlineperformance across different metrics, making them particularly suitable for sentiment analysis newlinetasks. In contrast, Random Forest and Decision Tree models, despite their high training newlineaccuracies, showed lower testing accuracies, indicating potential overfitting issues. XG Boost and newlineNaive Bayes models demonstrated moderate accuracy levels with consistent performance, newlineoffering a viable alternative for sentiment classification tasks. An essential aspect of the study newlineinvolved data analysis and visualization, focusing on the distribution of sentiments within the newlinetweets. This preliminary step informed the subsequent processes of feature extraction and newlineclassification. The use of word clouds for feature extraction and support vector machine-based newlineclassifiers for sentiment classification underscored the effectiveness of these methods in deriving newlineinsights from the data. The research utilized the sentiment140 dataset, consisting of 1.6 million newlinetweets, which were labeled for sentiment based on the presence of emoticons. This approach newlinehighlighted the innovative use of automated sentiment labeling in large datasets. The study also newlineunderscored the importance of balanced sentiment label distribution in training data for optimal newlinemodel performance. The findings lay a foundation for future research, emphasizing the adaptation newlineof these models to specific analytical conte |
Pagination: | 155pg. |
URI: | http://hdl.handle.net/10603/577142 |
Appears in Departments: | Department of Computer Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_titile.pdf | Attached File | 205.05 kB | Adobe PDF | View/Open |
02_prelim page.pdf | 279.25 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 269.14 kB | Adobe PDF | View/Open | |
05_chapter.1.pdf | 771.75 kB | Adobe PDF | View/Open | |
06_chapter.2.pdf | 483.32 kB | Adobe PDF | View/Open | |
07_chapter.3.pdf | 461.02 kB | Adobe PDF | View/Open | |
08_chapter.4.pdf | 1.87 MB | Adobe PDF | View/Open | |
09_chapter.5.pdf | 2.29 MB | Adobe PDF | View/Open | |
10_chapter.6.pdf | 363.13 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 358.42 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 566.17 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: