Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/464124
Title: | Design of sentiment analysis system based on multilingual social network data |
Researcher: | Garg, Neha |
Guide(s): | Sharma, Kamlesh |
Keywords: | Computer Science Computer Science Artificial Intelligence Engineering and Technology |
University: | Manav Rachna International Institute of Research and Studies |
Completed Date: | 2022 |
Abstract: | The usage of social sites is increasing among a huge variety of users like activists, spammers, trolls and commentators, etc. and the advancement of the World Wide Web (WWW) has enabled the users to post data in the structure of their choice (video, audio, text , etc.) and in the languages of their liking (English, Hindi, Spanish, French, German , etc.). With such flexibility, more users have started using social networking sites extensively. The users belong to different social and economic strata of the society and their academic credentials also vary to a large range. Due to the variation in types of users, the topics of interest on social networking sites vary from being dating, economy, environment, sports, academics, science, film, television, wildlife and politics. Language switching is also one of the aspects of such social media comments probably to address wider audience and to have a more impactful outcome. The opinions of different users provide feedback or information related to a particular field which is further used for Sentiment Analysis (SA). The main objective of the research is on mining tweets from Twitter for Hindi, English and Hinglish languages and performing the SA of the targeted events. The mining of sentiments in Hindi has its specific issues and challenges. Morphologically, Hindi is very rich and it is, order wise, very much a free language as compared to other languages such as English which decisively adds to complexity when user-generated content is handled. Limitation of resources for Hindi language invites new challenges which range from collection and generation of datasets. The objective of the research is to apply text preprocessing techniques on the collected dataset of Hindi, English and Hinglish languages and discuss their merit and demerits as well. The research also targets at building resources- annotated corpora and subjective lexicon for Hindi and Hinglish language. The corpus is annotated on the foundation of polarity of words into three categories i.e. negative, neut |
Pagination: | |
URI: | http://hdl.handle.net/10603/464124 |
Appears in Departments: | Department of Computer Science Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 323.73 kB | Adobe PDF | View/Open |
02_prelim.pdf | 379.51 kB | Adobe PDF | View/Open | |
03_content.pdf | 290.03 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 202.12 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 1.56 MB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 601.25 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 1.65 MB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 1.59 MB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 5.76 MB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 448.13 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 4.93 MB | Adobe PDF | View/Open | |
80_recommendation.pdf | 456.17 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: