Boosting Accuracy of Fake Review Prediction Using Synthetic Minority Oversampling Technique

Bhawna Saxena; Shruti Goyal; Anjali Kumari; Anushka Agarwal

doi:10.1109/icccis56430.2022.10037717

ScienceGate Book Chapters

JOURNAL ARTICLE

Boosting Accuracy of Fake Review Prediction Using Synthetic Minority Oversampling Technique

Bhawna Saxena Shruti Goyal Anjali Kumari Anushka Agarwal

Year: 2022 Journal: 2022 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS) Vol: 9 Pages: 156-161

DOI: 10.1109/icccis56430.2022.10037717

Get Full-Text PDF Get Analytical Report

Abstract

In recent times prior to making a purchase, the vast majority read reviews about that product, and their decision is largely driven by the reviews. Deceitful online sellers often gather fake or spam reviews for their products or services, thereby reducing the effectiveness of online reviews. The review data is often imbalanced such that the fake reviews greatly outnumber the genuine reviews. An imbalance leads to a bias, as the model tends to mostly predict the majority class. To attain a high-quality classification outcome, the issue of imbalanced data should be resolved before applying the classification algorithms. This paper studies the performance of supervised machine learning classifiers pertaining to fake review detection. The approach put forward in this paper aims to improve the prediction accuracy of popular supervised learning classifiers Random - Forest, LightGBM, XGBoost, Naive Bayes, and Decision Tree on an imbalanced review dataset For boosting the accuracy of these classifiers, the Synthetic Minority Oversampling Technique is used for addressing the class imbalance problem. The performance of the classifiers has been studied by changing the oversampling parameters. The application of SMOTE showed a significant improvement in the classifier's prediction accuracy.

Keywords:

Oversampling Machine learning Boosting (machine learning) Random forest Artificial intelligence Computer science Naive Bayes classifier Decision tree Classifier (UML) Random subspace method Support vector machine Data mining

Metrics

Cited By

0.50

FWCI (Field Weighted Citation Impact)

Refs

0.65

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Spam and Phishing Detection

Physical Sciences → Computer Science → Information Systems

Imbalanced Data Classification Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Malware Detection Techniques

Physical Sciences → Computer Science → Signal Processing

Boosting Accuracy of Fake Review Prediction Using Synthetic Minority Oversampling Technique

Abstract

Metrics

Citation History

Topics

Related Documents

Customer Churn Prediction Using Synthetic Minority Oversampling Technique

Improving the prediction accuracy in blended learning environment using synthetic minority oversampling technique

Synthetic Minority Oversampling Technique (SMOTE) for Boosting the Accuracy of C4.5 Algorithm Model

Machine Learning Based Hospital Mortality Prediction Using Synthetic Minority Oversampling Technique

Heart Disease Prediction using Synthetic Minority Oversampling Technique and Soft Voting