Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/356364
Title:  Unprejudiced Stemming Approach for Disambiguation of Social Media Corpora to Improve the Accuracy of Sentiment Analysis using Machine Learning
Researcher: Akula V S Sivarama Rao
Guide(s): Ranjana, P
Keywords: Computer Science
Computer Science Artificial Intelligence
Engineering and Technology
University: Hindustan University
Completed Date: 2021
Abstract: Big Data Analytics has emerged as a decision-centric approach for organizations newlineto uncover hidden patterns, correlations, market trends, and customer behavior. newlineWeb 2.0 textual data is one of the most popular sources of big data, and Web 2.0 newlinetechnologies generate huge social corpora from our daily lives. Natural Language newlineProcessing plays a vital role in Web 2.0 technology applications such as Internet newlinebusiness intelligence, reputation management, Sentiment Analysis, and opinion newlinemining. Natural Language Processing and Machine Learning are subfields of newlineArtificial Intelligence, which work together to solve big data analytical problems. newlineNatural Language Processing and Machine Learning can understand and analyze newlinethe natural language corpora on Social Media Networks and provide actionable newlinedata intelligence. Sentiment Analysis uses Natural Language Processing and newlineMachine Learning to extract insights from social corpora of a company, a newlinebusiness or service organization or government agency to improve the quality of newlineproducts, customer service, media perceptions, marketing strategies, sales, newlinecustomer retention, management reputation, trend analysis, new business newlineopportunities and crises management. According to the sentiment analysis survey, the challenges include bi-polar, NLPoverheads, newlinedomain dependence, negation, huge lexicon, world knowledge, newlineextracting features, spam-fake. Among these, the challenges of bipolar and NLPoverheads newlinehave the least analytical accuracy. Social Big Data contains newlineHomographs and Morphological ambiguities, which are the root cause of Bipolar newlineand NLP-overheads. The present research focuses on the data preparation phase newlineof Sentiment Analysis to improve the accuracy of Sentiment Analysis by disambiguating Homographs and Morphological terms and classifying the newlineDemographic attributes of the corpora. To disambiguate Homograph terms in social media corpora, we implemented newlineMachine Learning based on the Homograph Disambiguation algorithm and newlineachieved the state-of-the-art accuracy levels.
Pagination: 
URI: http://hdl.handle.net/10603/356364
Appears in Departments:Department of Computer Science and Engineering

Files in This Item:
File Description SizeFormat 
10_material.pdfAttached File520.01 kBAdobe PDFView/Open
11_result.pdf590.57 kBAdobe PDFView/Open
12_discussion.pdf63.36 kBAdobe PDFView/Open
13_summary.pdf28.42 kBAdobe PDFView/Open
15_references.pdf81.08 kBAdobe PDFView/Open
1_title.pdf23.27 kBAdobe PDFView/Open
2_certificate.pdf231 kBAdobe PDFView/Open
3_declaration.pdf22.53 kBAdobe PDFView/Open
4_ack.pdf21.51 kBAdobe PDFView/Open
5_content.pdf40.93 kBAdobe PDFView/Open
6_abstract.pdf23.45 kBAdobe PDFView/Open
7_listoftables.pdf42.21 kBAdobe PDFView/Open
80_recommendation.pdf104.19 kBAdobe PDFView/Open
8_introduction.pdf100.9 kBAdobe PDFView/Open
9_review.pdf118.28 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: