JOURNAL ARTICLE

Sentiment Analysis of Movie Review using Naïve Bayes Method with Gini Index Feature Selection

Abstract

In movie reviews, there is information that determines whether the movie is good or bad. Sentiment analysis is used to process information to determine the polarity of the sentence. With unstructured reviews and a lot of data attributes so that it requires much time and computational capabilities that become a problem in the classification process. To process a lot of data selection features becomes a solution to reduce dimensions so it accelerate the classification process and reduce the occurrence of misclassification. The first Gini Index Text feature selection used to classify documents and successfully enhanced the classifier performance. Multinomial Naïve Bayes (MNNB) is a popular classifier used for document classification however, will the Gini Index Text feature selection able to improve MNNB classification performance. Therefore in this study the author aims to use the Gini Index Text (GIT) for text feature selection with MNNB classifier to classify movie review into positive and negative classes. The data used is IMDB dataset that contains reviews in English sentences, the data will be divided into two parts, training data is 90% and data testing is 10%. The test results prove that the Gini index as a selection feature can increase accuracy where accuracy without feature selection is 56% and with feature selection of 59.54% with an increase of 3.54%.

Keywords:
Feature selection Artificial intelligence Computer science Naive Bayes classifier Classifier (UML) Sentence Feature (linguistics) Machine learning Sentiment analysis Selection (genetic algorithm) Index (typography) Data mining Pattern recognition (psychology) Natural language processing Support vector machine

Metrics

9
Cited By
1.82
FWCI (Field Weighted Citation Impact)
0
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining and Machine Learning Applications
Physical Sciences →  Computer Science →  Information Systems
Multimedia Learning Systems
Physical Sciences →  Computer Science →  Information Systems
Edcuational Technology Systems
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.