Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/586701
Title: | Investigation of Fake Profiles in Social Media Content Using Natural Language Processing |
Researcher: | Kadam, Nitika |
Guide(s): | Sharma, Sanjeev Kumar |
Keywords: | Computer Science Engineering and Technology Fake Profile Imaging Science and Photographic Technology Machine Learning |
University: | Oriental University |
Completed Date: | 2024 |
Abstract: | Data mining and machine learning employed on huge data help to recover the target patterns for the applications of prediction, classification, and pattern recognition. The machine learning technique is the best fit for the binary classification problems. Thus, the proposed work introducesa new multi-model feature classification system for fake profile identification on social media. The work is conducted in two main parts, the first part includes the dataset collection and refinement of different profile attributes. The second part includes, merging of profile attributes and profile content using data profiling. In the dataset collection and refinement phase, dataset is used as the profile attributes and thereafter data includes the contents published by the user. Thereafter, data features are reduced and only essential features are used for experimentation. In the next phase, the extracted published contents are used with the Natural Language Processing (NLP) parser for transforming the data into the vector format. Thereafter, profile attribute and profile content are combined using the data profiling. The profiled data is further used with an improved Artificial Neural Network (ANN) to learn and classify the data. An algorithm is proposed to compare two popular machine learning models i.e., Support Vector Machine (SVM) and ANN to classify the data. The decision scores are finally generated based on classification outcomes. All the experiments have been done using real world twitter published dataset of user profiles which demonstrate and helps in the identification of fake user profiles. Finally, the performance and accuracy of proposed model has been analyzed using certain accuracy matrices such as Precision, Recall, F-Measures. newline |
Pagination: | 99 |
URI: | http://hdl.handle.net/10603/586701 |
Appears in Departments: | Computer Science & Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_ title.pdf | Attached File | 741.54 kB | Adobe PDF | View/Open |
02_prelim-pages.pdf | 1.42 MB | Adobe PDF | View/Open | |
03_content-pages.pdf | 870.45 kB | Adobe PDF | View/Open | |
04_abstract.pdf | 735.7 kB | Adobe PDF | View/Open | |
05_chapter 1.pdf | 890.36 kB | Adobe PDF | View/Open | |
06_chapter 2.pdf | 1.81 MB | Adobe PDF | View/Open | |
07_chapter 3.pdf | 1.63 MB | Adobe PDF | View/Open | |
08_chapter 4.pdf | 1.49 MB | Adobe PDF | View/Open | |
09_chapter 5.pdf | 862.18 kB | Adobe PDF | View/Open | |
10_annexure.pdf | 1.61 MB | Adobe PDF | View/Open | |
80_recommendation.pdf | 741.54 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: