Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/366164
Title: | Enhanced Multi Feature based Machine Learning Techniques for Identification of Online Spam Reviews |
Researcher: | Krishnaveni N |
Guide(s): | Radha V |
Keywords: | Engineering and Technology Computer Science Computer Science Interdisciplinary Applications |
University: | Avinashilingam Institute for Home Science and Higher Education for Women |
Completed Date: | 2021 |
Abstract: | In any online e-commerce business, online reviews are powerful medium for expressing opinions, where the ratings and customer feedbacks are the centerpiece. In the past decade, spam reviews have trouble the e-commerce sector worldwide. These reviews can be used to control the sentiment of a product in a negative manner and therefore, are considered extremely harmful to online businesses and individuals alike. In order to detect and remove these harmful reviews, this research work proposes spam detection system that consists of two steps, namely, feature engineering and classification. newlineThe research methodology was designed in three phases, where the first phase focus on feature engineering, while the second and third phases focus on the classification step. Phase I (feature engineering) performs two tasks, namely, feature extraction and optimal feature vector construction. Task 1 extracts three types of features, namely, review centric features, reviewer centric features and product centric features. A total of 50 features were extracted. In order to handle the problem of high dimensionality, a feature selection algorithm that combined enhanced maximum relevant minimum redundant filter based algorithm with ant colony optimization algorithm combined with genetic algorithm, was proposed. newlinePhases II and III used this optimal feature vector. Phase II focused on designing an enhanced ensemble system. The base classifier used was support vector machine which was enhanced to improve its accuracy and reduce its time complexity. A hyperplane construction method that is based on Mahalanobis distance measure was proposed to improve the accuracy of the Support Vector Machine (SVM) classifier. Similarly, a speed optimization procedure that removed irrelevant support vectors was also proposed. The ensemble system was constructed by differing the kernel function used by the SVM classifier. In Phase III, hybrid spam review detection systems that combined clustering, classification and the enhanced ensemble system from Phase II was |
Pagination: | 208 p. |
URI: | http://hdl.handle.net/10603/366164 |
Appears in Departments: | Department of Computer Science |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 4.61 kB | Adobe PDF | View/Open |
02_certificate.pdf | 407.59 kB | Adobe PDF | View/Open | |
03_acknowledgement.pdf | 56.1 kB | Adobe PDF | View/Open | |
04_contents.pdf | 21.59 kB | Adobe PDF | View/Open | |
05_list of tables, fiures and abbreviations.pdf | 243.58 kB | Adobe PDF | View/Open | |
06_chapter 1.pdf | 1.41 MB | Adobe PDF | View/Open | |
07_chapter 2.pdf | 617.87 kB | Adobe PDF | View/Open | |
08_chapter 3.pdf | 304.52 kB | Adobe PDF | View/Open | |
09_chapter 4.pdf | 897.78 kB | Adobe PDF | View/Open | |
10_chapter 5.pdf | 520.57 kB | Adobe PDF | View/Open | |
11_chapter 6.pdf | 658.31 kB | Adobe PDF | View/Open | |
12_chapter 7.pdf | 746.5 kB | Adobe PDF | View/Open | |
13_chapter 8.pdf | 306.1 kB | Adobe PDF | View/Open | |
14_bibliography.pdf | 424.7 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 99.25 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: