Please use this identifier to cite or link to this item:
Title: Enhanced Multi Feature based Machine Learning Techniques for Identification of Online Spam Reviews
Researcher: Krishnaveni N
Guide(s): Radha V
Keywords: Engineering and Technology
Computer Science
Computer Science Interdisciplinary Applications
University: Avinashilingam Institute for Home Science and Higher Education for Women
Completed Date: 2021
Abstract: In any online e-commerce business, online reviews are powerful medium for expressing opinions, where the ratings and customer feedbacks are the centerpiece. In the past decade, spam reviews have trouble the e-commerce sector worldwide. These reviews can be used to control the sentiment of a product in a negative manner and therefore, are considered extremely harmful to online businesses and individuals alike. In order to detect and remove these harmful reviews, this research work proposes spam detection system that consists of two steps, namely, feature engineering and classification. newlineThe research methodology was designed in three phases, where the first phase focus on feature engineering, while the second and third phases focus on the classification step. Phase I (feature engineering) performs two tasks, namely, feature extraction and optimal feature vector construction. Task 1 extracts three types of features, namely, review centric features, reviewer centric features and product centric features. A total of 50 features were extracted. In order to handle the problem of high dimensionality, a feature selection algorithm that combined enhanced maximum relevant minimum redundant filter based algorithm with ant colony optimization algorithm combined with genetic algorithm, was proposed. newlinePhases II and III used this optimal feature vector. Phase II focused on designing an enhanced ensemble system. The base classifier used was support vector machine which was enhanced to improve its accuracy and reduce its time complexity. A hyperplane construction method that is based on Mahalanobis distance measure was proposed to improve the accuracy of the Support Vector Machine (SVM) classifier. Similarly, a speed optimization procedure that removed irrelevant support vectors was also proposed. The ensemble system was constructed by differing the kernel function used by the SVM classifier. In Phase III, hybrid spam review detection systems that combined clustering, classification and the enhanced ensemble system from Phase II was
Pagination: 208 p.
Appears in Departments:Department of Computer Science

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File4.61 kBAdobe PDFView/Open
02_certificate.pdf407.59 kBAdobe PDFView/Open
03_acknowledgement.pdf56.1 kBAdobe PDFView/Open
04_contents.pdf21.59 kBAdobe PDFView/Open
05_list of tables, fiures and abbreviations.pdf243.58 kBAdobe PDFView/Open
06_chapter 1.pdf1.41 MBAdobe PDFView/Open
07_chapter 2.pdf617.87 kBAdobe PDFView/Open
08_chapter 3.pdf304.52 kBAdobe PDFView/Open
09_chapter 4.pdf897.78 kBAdobe PDFView/Open
10_chapter 5.pdf520.57 kBAdobe PDFView/Open
11_chapter 6.pdf658.31 kBAdobe PDFView/Open
12_chapter 7.pdf746.5 kBAdobe PDFView/Open
13_chapter 8.pdf306.1 kBAdobe PDFView/Open
14_bibliography.pdf424.7 kBAdobe PDFView/Open
80_recommendation.pdf99.25 kBAdobe PDFView/Open
Show full item record

Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: