JOURNAL ARTICLE

The Effect Of Chi-Square Feature Selection On The Naive Bayes Algorithm In Analyzing The Sentiment Of Gojek Application Reviews On Google Play Store

Rafael Handika DwinantaShinta Estri Wahyuningrum

Year: 2025 Journal:   Proxies Jurnal Informatika Vol: 9 (1)Pages: 23-29   Publisher: Soegijapranata Catholic University

Abstract

This study analyzes customer sentiment in reviewing the Gojek application to find out whether Chi-Square feature selection can improve the performance of the sentiment analysis model. This study uses 12,000 Gojek review data, starting with labeling positive, negative, or neutral based on user ratings of the reviews. Naive Bayes with and without Chi-Square feature selection is used in testing related to accuracy, precision, recall, and F1 score. The best performance is obtained by using alpha 0.5 combined with the best 2000 Chi-Square features, which produces 86.96% accuracy, 87.84% precision, 86.96% recall, and 85.29% F1 score on imbalanced data. SMOTE is also used to handle the low number of neutral reviews, but it produces lower accuracy. In conclusion, Chi-Square feature selection in the Naive Bayes algorithm can improve model accuracy on imbalanced and balanced datasets.

Keywords:

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.56
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Data Mining and Machine Learning Applications
Physical Sciences →  Computer Science →  Information Systems
Multimedia Learning Systems
Physical Sciences →  Computer Science →  Information Systems
Information Retrieval and Data Mining
Physical Sciences →  Computer Science →  Information Systems
© 2026 ScienceGate Book Chapters — All rights reserved.