JOURNAL ARTICLE

Sentiment Analysis Using Hybrid Feature Selection Techniques

Abstract

Nowadays, people from every part of the world use social media and social networks to express their feelings toward different topics and aspects. One of the trendiest social media is Twitter, which is a microblogging website that provides a platform for its users to share their views and feelings about products, services, events, etc., in public. Which makes Twitter one of the most valuable sources for collecting and analyzing data by researchers and developers to reveal people sentiment about different topics and services, such as products of commercial companies, services, well-known people such as politicians and athletes, through classifying those sentiments into positive and negative. Classification of people sentiment could be automated through using machine learning algorithms and could be enhanced through using appropriate feature selection methods. We collected most recent tweets about (Amazon, Trump, Chelsea FC, CR7) using Twitter-Application Programming Interface and assigned sentiment score using lexicon rule-based approach, then proposed a machine learning model to improve classification accuracy through using hybrid feature selection method, namely, filter-based feature selection method Chi-square (Chi-2) plus wrapper-based binary coordinate ascent (Chi-2 + BCA) to select optimal subset of features from term frequency-inverse document frequency (TF-IDF) generated features for classification through support vector machine (SVM), and Bag of words generated features for logistic regression (LR) classifiers using different n-gram ranges. After comparing the hybrid (Chi-2+BCA) method with (Chi-2) selected features, and also with the classifiers without feature subset selection, results show that the hybrid feature selection method increases classification accuracy in all cases. The maximum attained accuracy with LR is 86.55% using (1 + 2 + 3-g) range, with SVM is 85.575% using the unigram range, both in the CR7 dataset.

Keywords:
Feature selection Sentiment analysis Computer science Support vector machine Artificial intelligence Machine learning Microblogging Social media Feature (linguistics) Selection (genetic algorithm) Lexicon tf–idf Filter (signal processing) Feature vector Data mining World Wide Web Term (time)

Metrics

1
Cited By
0.15
FWCI (Field Weighted Citation Impact)
24
Refs
0.52
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Efficient feature selection techniques for sentiment analysis

Avinash MadasuE. Sivasankar

Journal:   Multimedia Tools and Applications Year: 2019 Vol: 79 (9-10)Pages: 6313-6335
JOURNAL ARTICLE

Sentiment Reviews Classification using Hybrid Feature Selection

K. Selva BhuvaneswariR. Parimala

Journal:   International Journal of Database Theory and Application Year: 2017 Vol: 10 (7)Pages: 1-12
JOURNAL ARTICLE

Feature selection for sentiment analysis using hybrid multiobjective evolutionary algorithm

Rimsha GulMaryam Bashir

Journal:   Journal of Intelligent & Fuzzy Systems Year: 2024 Vol: 46 (4)Pages: 8917-8932
JOURNAL ARTICLE

Sentiment classification using hybrid feature selection and ensemble classifier

Achin JainVanita Jain

Journal:   Journal of Intelligent & Fuzzy Systems Year: 2021 Vol: 42 (2)Pages: 659-668
© 2026 ScienceGate Book Chapters — All rights reserved.