Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/356364
Title: | Unprejudiced Stemming Approach for Disambiguation of Social Media Corpora to Improve the Accuracy of Sentiment Analysis using Machine Learning |
Researcher: | Akula V S Sivarama Rao |
Guide(s): | Ranjana, P |
Keywords: | Computer Science Computer Science Artificial Intelligence Engineering and Technology |
University: | Hindustan University |
Completed Date: | 2021 |
Abstract: | Big Data Analytics has emerged as a decision-centric approach for organizations newlineto uncover hidden patterns, correlations, market trends, and customer behavior. newlineWeb 2.0 textual data is one of the most popular sources of big data, and Web 2.0 newlinetechnologies generate huge social corpora from our daily lives. Natural Language newlineProcessing plays a vital role in Web 2.0 technology applications such as Internet newlinebusiness intelligence, reputation management, Sentiment Analysis, and opinion newlinemining. Natural Language Processing and Machine Learning are subfields of newlineArtificial Intelligence, which work together to solve big data analytical problems. newlineNatural Language Processing and Machine Learning can understand and analyze newlinethe natural language corpora on Social Media Networks and provide actionable newlinedata intelligence. Sentiment Analysis uses Natural Language Processing and newlineMachine Learning to extract insights from social corpora of a company, a newlinebusiness or service organization or government agency to improve the quality of newlineproducts, customer service, media perceptions, marketing strategies, sales, newlinecustomer retention, management reputation, trend analysis, new business newlineopportunities and crises management. According to the sentiment analysis survey, the challenges include bi-polar, NLPoverheads, newlinedomain dependence, negation, huge lexicon, world knowledge, newlineextracting features, spam-fake. Among these, the challenges of bipolar and NLPoverheads newlinehave the least analytical accuracy. Social Big Data contains newlineHomographs and Morphological ambiguities, which are the root cause of Bipolar newlineand NLP-overheads. The present research focuses on the data preparation phase newlineof Sentiment Analysis to improve the accuracy of Sentiment Analysis by disambiguating Homographs and Morphological terms and classifying the newlineDemographic attributes of the corpora. To disambiguate Homograph terms in social media corpora, we implemented newlineMachine Learning based on the Homograph Disambiguation algorithm and newlineachieved the state-of-the-art accuracy levels. |
Pagination: | |
URI: | http://hdl.handle.net/10603/356364 |
Appears in Departments: | Department of Computer Science and Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
10_material.pdf | Attached File | 520.01 kB | Adobe PDF | View/Open |
11_result.pdf | 590.57 kB | Adobe PDF | View/Open | |
12_discussion.pdf | 63.36 kB | Adobe PDF | View/Open | |
13_summary.pdf | 28.42 kB | Adobe PDF | View/Open | |
15_references.pdf | 81.08 kB | Adobe PDF | View/Open | |
1_title.pdf | 23.27 kB | Adobe PDF | View/Open | |
2_certificate.pdf | 231 kB | Adobe PDF | View/Open | |
3_declaration.pdf | 22.53 kB | Adobe PDF | View/Open | |
4_ack.pdf | 21.51 kB | Adobe PDF | View/Open | |
5_content.pdf | 40.93 kB | Adobe PDF | View/Open | |
6_abstract.pdf | 23.45 kB | Adobe PDF | View/Open | |
7_listoftables.pdf | 42.21 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 104.19 kB | Adobe PDF | View/Open | |
8_introduction.pdf | 100.9 kB | Adobe PDF | View/Open | |
9_review.pdf | 118.28 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: