Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/519875
Title: | Certain investigation for text classification on fake news data using optimized deep learning techniques |
Researcher: | Nithya, K |
Guide(s): | Krishnamoorthi, M and Sampth, P |
Keywords: | Computer Science Computer Science Information Systems Deep learning Engineering and Technology Fake news data Text classification |
University: | Anna University |
Completed Date: | 2023 |
Abstract: | In social media platforms, numerous data are generated every second newlineand the users of these platforms share the contents instantly when they like to newlineshare. The contents posted in social media are mostly unverified and there is a newlinehigh possibility of the presence of inaccurate information. Spread of fake newlineinformation on social media misleads the users about some situations or newlinetopics in political and social sectors. Rumour data classification in social newlinemedia seems to be an area of recent research due to its dependency on digital newlinecommunications, and this rumour data make social media unstable. newlineIn the field of Natural Language Processing (NLP), research on rumour newlinedetection has its challenges with word embedding, feature extraction and newlineclassification tasks. Previously, many researches have been conducted, but the newlineaccuracy of rumour detection is a big issue for identifying infrequent/rare newlinewords. The traditional word embedding models like Word2Vec, FastText and newlineso on., perform word level analysis for finding word vectors. In recent days, newlinethese models are replaced with the novel transformer-based BERT model newlinewhich considers the combination of word pieces for producing efficient newlinefeature vectors. Still, it is difficult to identify which word piece of each token newlineis to be used for vector formation. In the first part of the research, an automated fake news detection system is introduced with three different word embedding techniques such as Word2Vec, GloVe and FastText. Further, it is classified by using two newlineproposed algorithms namely Optimized Cost Sensitive Long Short-Term newlineMemory (OC-LSTM) and Optimized Cost Sensitive Gated Recurrent Unit newline(OC-GRU). For the LIAR and Fake and Real News (ISOT) datasets, the newlineFastText embedding with OC-GRU model yields improved accuracy. newlineTo improve the quality and accuracy of features, in the second part of newlinethe research, two phases of feature extraction techniques are involved for newlineperforming word embedding and deep feature extraction. In the first phase, newlineBidirectional Encoder Representations from Tran |
Pagination: | xix,129p. |
URI: | http://hdl.handle.net/10603/519875 |
Appears in Departments: | Faculty of Information and Communication Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 13.24 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 2.54 MB | Adobe PDF | View/Open | |
03_content.pdf | 625.3 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 257.79 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 482.23 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 443.27 kB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 838.49 kB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 1.06 MB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 1.08 MB | Adobe PDF | View/Open | |
10_chapter 6.pdf | 449.33 kB | Adobe PDF | View/Open | |
11_annexures.pdf | 1.91 MB | Adobe PDF | View/Open | |
80_recommendation.pdf | 43.31 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: