Unprejudiced Stemming Approach for  Disambiguation of Social Media Corpora  to Improve the Accuracy of Sentiment  Analysis  using Machine Learning

Akula V S Sivarama Rao

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/356364

Title:	Unprejudiced Stemming Approach for Disambiguation of Social Media Corpora to Improve the Accuracy of Sentiment Analysis using Machine Learning
Researcher:	Akula V S Sivarama Rao
Guide(s):	Ranjana, P
Keywords:	Computer Science Computer Science Artificial Intelligence Engineering and Technology
University:	Hindustan University
Completed Date:	2021
Abstract:	Big Data Analytics has emerged as a decision-centric approach for organizations newlineto uncover hidden patterns, correlations, market trends, and customer behavior. newlineWeb 2.0 textual data is one of the most popular sources of big data, and Web 2.0 newlinetechnologies generate huge social corpora from our daily lives. Natural Language newlineProcessing plays a vital role in Web 2.0 technology applications such as Internet newlinebusiness intelligence, reputation management, Sentiment Analysis, and opinion newlinemining. Natural Language Processing and Machine Learning are subfields of newlineArtificial Intelligence, which work together to solve big data analytical problems. newlineNatural Language Processing and Machine Learning can understand and analyze newlinethe natural language corpora on Social Media Networks and provide actionable newlinedata intelligence. Sentiment Analysis uses Natural Language Processing and newlineMachine Learning to extract insights from social corpora of a company, a newlinebusiness or service organization or government agency to improve the quality of newlineproducts, customer service, media perceptions, marketing strategies, sales, newlinecustomer retention, management reputation, trend analysis, new business newlineopportunities and crises management. According to the sentiment analysis survey, the challenges include bi-polar, NLPoverheads, newlinedomain dependence, negation, huge lexicon, world knowledge, newlineextracting features, spam-fake. Among these, the challenges of bipolar and NLPoverheads newlinehave the least analytical accuracy. Social Big Data contains newlineHomographs and Morphological ambiguities, which are the root cause of Bipolar newlineand NLP-overheads. The present research focuses on the data preparation phase newlineof Sentiment Analysis to improve the accuracy of Sentiment Analysis by disambiguating Homographs and Morphological terms and classifying the newlineDemographic attributes of the corpora. To disambiguate Homograph terms in social media corpora, we implemented newlineMachine Learning based on the Homograph Disambiguation algorithm and newlineachieved the state-of-the-art accuracy levels.
Pagination:
URI:	http://hdl.handle.net/10603/356364
Appears in Departments:	Department of Computer Science and Engineering

Files in This Item:

File	Description	Size	Format
10_material.pdf	Attached File	520.01 kB	Adobe PDF	View/Open
11_result.pdf		590.57 kB	Adobe PDF	View/Open
12_discussion.pdf		63.36 kB	Adobe PDF	View/Open
13_summary.pdf		28.42 kB	Adobe PDF	View/Open
15_references.pdf		81.08 kB	Adobe PDF	View/Open
1_title.pdf		23.27 kB	Adobe PDF	View/Open
2_certificate.pdf		231 kB	Adobe PDF	View/Open
3_declaration.pdf		22.53 kB	Adobe PDF	View/Open
4_ack.pdf		21.51 kB	Adobe PDF	View/Open
5_content.pdf		40.93 kB	Adobe PDF	View/Open
6_abstract.pdf		23.45 kB	Adobe PDF	View/Open
7_listoftables.pdf		42.21 kB	Adobe PDF	View/Open
80_recommendation.pdf		104.19 kB	Adobe PDF	View/Open
8_introduction.pdf		100.9 kB	Adobe PDF	View/Open
9_review.pdf		118.28 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET