Design of sentiment analysis system based on multilingual social network data

Garg, Neha

Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/464124

Title:	Design of sentiment analysis system based on multilingual social network data
Researcher:	Garg, Neha
Guide(s):	Sharma, Kamlesh
Keywords:	Computer Science Computer Science Artificial Intelligence Engineering and Technology
University:	Manav Rachna International Institute of Research and Studies
Completed Date:	2022
Abstract:	The usage of social sites is increasing among a huge variety of users like activists, spammers, trolls and commentators, etc. and the advancement of the World Wide Web (WWW) has enabled the users to post data in the structure of their choice (video, audio, text , etc.) and in the languages of their liking (English, Hindi, Spanish, French, German , etc.). With such flexibility, more users have started using social networking sites extensively. The users belong to different social and economic strata of the society and their academic credentials also vary to a large range. Due to the variation in types of users, the topics of interest on social networking sites vary from being dating, economy, environment, sports, academics, science, film, television, wildlife and politics. Language switching is also one of the aspects of such social media comments probably to address wider audience and to have a more impactful outcome. The opinions of different users provide feedback or information related to a particular field which is further used for Sentiment Analysis (SA). The main objective of the research is on mining tweets from Twitter for Hindi, English and Hinglish languages and performing the SA of the targeted events. The mining of sentiments in Hindi has its specific issues and challenges. Morphologically, Hindi is very rich and it is, order wise, very much a free language as compared to other languages such as English which decisively adds to complexity when user-generated content is handled. Limitation of resources for Hindi language invites new challenges which range from collection and generation of datasets. The objective of the research is to apply text preprocessing techniques on the collected dataset of Hindi, English and Hinglish languages and discuss their merit and demerits as well. The research also targets at building resources- annotated corpora and subjective lexicon for Hindi and Hinglish language. The corpus is annotated on the foundation of polarity of words into three categories i.e. negative, neut
Pagination:
URI:	http://hdl.handle.net/10603/464124
Appears in Departments:	Department of Computer Science Engineering

Files in This Item:

File	Description	Size	Format
01_title.pdf	Attached File	323.73 kB	Adobe PDF	View/Open
02_prelim.pdf		379.51 kB	Adobe PDF	View/Open
03_content.pdf		290.03 kB	Adobe PDF	View/Open
04_abstract.pdf		202.12 kB	Adobe PDF	View/Open
05_chapter 1.pdf		1.56 MB	Adobe PDF	View/Open
06_chapter 2.pdf		601.25 kB	Adobe PDF	View/Open
07_chapter 3.pdf		1.65 MB	Adobe PDF	View/Open
08_chapter 4.pdf		1.59 MB	Adobe PDF	View/Open
09_chapter 5.pdf		5.76 MB	Adobe PDF	View/Open
10_chapter 6.pdf		448.13 kB	Adobe PDF	View/Open
11_annexures.pdf		4.93 MB	Adobe PDF	View/Open
80_recommendation.pdf		456.17 kB	Adobe PDF	View/Open

Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge:

Shodhganga : a reservoir of Indian theses @ INFLIBNET