JOURNAL ARTICLE

Sentiment Classification on Multivariate Feature Selection on Social Media dataset using Hybrid Machine Learning Techniques

Sudeep K. Hase

Year: 2024 Journal:   Journal of Information Systems Engineering & Management Vol: 10 (1s)Pages: 525-539   Publisher: Lectito Journals

Abstract

Sentiment classification is a crucial component of natural language processing that focuses on analyzing and classifying the emotional tone conveyed in text data. With the rapid proliferation of social media platforms, the ability to accurately discern public sentiment has become vital for applications spanning marketing, political forecasting, and public opinion analysis. This abstract delves into the implementation of hybrid machine learning techniques for sentiment classification, leveraging multivariate feature selection methods on diverse social media datasets. Traditional machine learning models, though effective, often struggle with the complexity and high dimensionality of social media data, which may include text, emojis, images, and metadata. A hybrid machine learning approach, combining the strengths of various models, addresses these challenges by optimizing both feature selection and classification accuracy. The proposed framework begins with robust data preprocessing, including text normalization and tokenization. Advanced feature extraction methods such as Term Frequency-Inverse Document Frequency (TF-IDF), word embeddings (Word2Vec, GloVe), and sentiment lexicons are utilized to capture the intricate semantic characteristics of the text. For multivariate feature selection, techniques such as Recursive Feature Elimination (RFE), Chi-square tests, and correlation-based feature selection (CFS) are employed to identify and retain the most informative features, thereby improving model efficiency. The classification stage integrates hybrid models, combining the predictive power of algorithms such as Support Vector Machines (SVM), Random Forests, and ensemble learning methods (e.g., gradient boosting). These models are tuned using cross-validation and grid search to enhance generalization performance. The hybrid approach demonstrates superior performance in terms of accuracy, precision, recall, and F1-score compared to standalone machine learning models. The combination of comprehensive feature selection and robust classification algorithms effectively mitigates overfitting and enhances scalability. Empirical results from experiments on real-world social media datasets indicate that the proposed method is adept at capturing nuanced sentiment variations and ensuring high classification accuracy, proving its effectiveness for dynamic and large-scale data analysis.

Keywords:
Feature selection Computer science Artificial intelligence Multivariate statistics Selection (genetic algorithm) Machine learning Social media Feature (linguistics) Sentiment analysis Pattern recognition (psychology) World Wide Web

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.26
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Spam and Phishing Detection
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Hybrid Feature Selection on Social Media Dataset for Sentiment Classification using Deep Learning Techniques

Rashmi Soni Sudeep K. Hase

Journal:   Communications on Applied Nonlinear Analysis Year: 2025 Vol: 32 (9s)Pages: 1899-1918
JOURNAL ARTICLE

Hybrid Ensemble Learning With Feature Selection for Sentiment Classification in Social Media

Sanur SharmaAnurag Jain

Journal:   International Journal of Information Retrieval Research Year: 2020 Vol: 10 (2)Pages: 40-58
JOURNAL ARTICLE

Sentiment Reviews Classification using Hybrid Feature Selection

K. Selva BhuvaneswariR. Parimala

Journal:   International Journal of Database Theory and Application Year: 2017 Vol: 10 (7)Pages: 1-12
© 2026 ScienceGate Book Chapters — All rights reserved.