Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/520023
Title: Stream data analysis using advanced machine learning approaches
Researcher: Arun Manicka Raja, M
Guide(s): Swamynathan, S
Keywords: Computer Science
Computer Science Information Systems
Engineering and Technology
Polarity oriented words
Social networks
Stream data
University: Anna University
Completed Date: 2022
Abstract: Stream data analysis involves the process of identifying the potential patterns newlinefrom the data streams from various sources. Many of the data stream sources newlineare social media applications. Social networks evolved as a mandate for newlinesharing information instantly. Social networking applications are prominent newlineamong the internet user communities. Many social media websites are used newlinefor sharing the information instantly. newlineTwitter is one of the vibrant social networking websites for sharing newlinesmall textual information within a short span of time. It is essential to identify newlinethe type of information shared on these websites. The twitter stream data newlineanalysis mainly involves the sentiment analysis process using various trained newlinemachine learning classifiers applied on a large collection of tweets. The newlineclassifiers are trained using a maximum number of polarity oriented words for newlineeffectively classifying the tweets. The trained classifiers at sentence level newlineoutperformed the keyword based classification method. The classified tweets newlineare further analyzed for identifying the top tweets. The experimental results newlineshow that the sentiment analysis process predicted polarities of tweet and newlineeffectively identified the top tweets. In addition to the polarity prediction, newlinescore calculation of sentiment content on tweets is essential. newlineSentiment score calculation is carried out with sentiment corpus newlineapproach for calculating the score effectively. Especially, the grammatical newlinetype of the word used in a tweet and the relationship between the words are newlineproperly identified. The tweet tagger and corpus sentiment score assignment newlineis distinctively used when compared to other previous works. newline
Pagination: xxii,167p.
URI: http://hdl.handle.net/10603/520023
Appears in Departments:Faculty of Information and Communication Engineering

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File27.63 kBAdobe PDFView/Open
02_prelim pages.pdf1.86 MBAdobe PDFView/Open
03_content.pdf53.24 kBAdobe PDFView/Open
04_abstract.pdf115.26 kBAdobe PDFView/Open
05_chapter 1.pdf291.78 kBAdobe PDFView/Open
06_chapter 2.pdf175.01 kBAdobe PDFView/Open
07_chapter 3.pdf118.88 kBAdobe PDFView/Open
08_chapter 4.pdf329.15 kBAdobe PDFView/Open
09_chapter 5.pdf1.21 MBAdobe PDFView/Open
10_chapter 6.pdf817.03 kBAdobe PDFView/Open
11_chapter 7.pdf479.85 kBAdobe PDFView/Open
12_chapter 8.pdf415.1 kBAdobe PDFView/Open
13_annexures.pdf125.93 kBAdobe PDFView/Open
80_recommendation.pdf76.44 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: