Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/520023
Title: | Stream data analysis using advanced machine learning approaches |
Researcher: | Arun Manicka Raja, M |
Guide(s): | Swamynathan, S |
Keywords: | Computer Science Computer Science Information Systems Engineering and Technology Polarity oriented words Social networks Stream data |
University: | Anna University |
Completed Date: | 2022 |
Abstract: | Stream data analysis involves the process of identifying the potential patterns newlinefrom the data streams from various sources. Many of the data stream sources newlineare social media applications. Social networks evolved as a mandate for newlinesharing information instantly. Social networking applications are prominent newlineamong the internet user communities. Many social media websites are used newlinefor sharing the information instantly. newlineTwitter is one of the vibrant social networking websites for sharing newlinesmall textual information within a short span of time. It is essential to identify newlinethe type of information shared on these websites. The twitter stream data newlineanalysis mainly involves the sentiment analysis process using various trained newlinemachine learning classifiers applied on a large collection of tweets. The newlineclassifiers are trained using a maximum number of polarity oriented words for newlineeffectively classifying the tweets. The trained classifiers at sentence level newlineoutperformed the keyword based classification method. The classified tweets newlineare further analyzed for identifying the top tweets. The experimental results newlineshow that the sentiment analysis process predicted polarities of tweet and newlineeffectively identified the top tweets. In addition to the polarity prediction, newlinescore calculation of sentiment content on tweets is essential. newlineSentiment score calculation is carried out with sentiment corpus newlineapproach for calculating the score effectively. Especially, the grammatical newlinetype of the word used in a tweet and the relationship between the words are newlineproperly identified. The tweet tagger and corpus sentiment score assignment newlineis distinctively used when compared to other previous works. newline |
Pagination: | xxii,167p. |
URI: | http://hdl.handle.net/10603/520023 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 27.63 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 1.86 MB | Adobe PDF | View/Open | |
03_content.pdf | 53.24 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 115.26 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 291.78 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 175.01 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 118.88 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 329.15 kB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 1.21 MB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 817.03 kB | Adobe PDF | View/Open | |
11_chapter 7.pdf | 479.85 kB | Adobe PDF | View/Open | |
12_chapter 8.pdf | 415.1 kB | Adobe PDF | View/Open | |
13_annexures.pdf | 125.93 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 76.44 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: