JOURNAL ARTICLE

Performance Enhancement of the Unbalanced Text Classification Problem Through a Modified Chi Square-Based Feature Selection Technique

Santosh Kumar BeheraRajashree Dash

Year: 2022 Journal:   International Journal of Intelligent Information Technologies Vol: 18 (1)Pages: 1-23   Publisher: IGI Global

Abstract

This paper proposes a modified chi square-based feature selection algorithm in conjunction with a random vector functional link network-based text classifier for improving the classification performance of multi-labeled text documents with unbalanced class distributions. In the proposed feature selection method, maximum features are selected from classes that have a great deal of training and testing documents as an improvement towards original chi-square method. On two benchmark datasets that are multi-labeled, multi-class, and unbalanced, a comparison of the model with three conventional selection techniques such as chi-square, term frequency-inverse document frequency, and mutual information is accumulated for assessing its effectiveness. Additionally, the proposed model is compared with four different classifiers. In the study, it was found that the proposed model performs better in terms of precision, recall, f-measure, and hamming losses and is able to select the majority of true positive documents despite an unbalanced class distribution for both the datasets.

Keywords:
Computer science Feature selection Artificial intelligence Classifier (UML) Pattern recognition (psychology) Benchmark (surveying) Selection (genetic algorithm) Class (philosophy) Machine learning Data mining

Metrics

2
Cited By
0.39
FWCI (Field Weighted Citation Impact)
27
Refs
0.62
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

JOURNAL ARTICLE

A novel feature selection technique for enhancing performance of unbalanced text classification problem

Santosh Kumar BeheraRajashree Dash

Journal:   Intelligent Decision Technologies Year: 2022 Vol: 16 (1)Pages: 51-69
JOURNAL ARTICLE

Effective feature selection technique for text classification

Hari SeethaM. Narasimha MurtyR. Saravanan

Journal:   International Journal of Data Mining Modelling and Management Year: 2015 Vol: 7 (3)Pages: 165-165
BOOK-CHAPTER

Modified Pointwise Mutual Information-Based Feature Selection for Text Classification

Tsvetanka Georgieva‐Trifonova

Lecture notes in networks and systems Year: 2021 Pages: 333-353
JOURNAL ARTICLE

Unbalanced Data Classification using Feature Selection through BitApriori Algorithm.

Pratik A. BarotHarikrishna B. Jethva

Journal:   International Journal of Computer Sciences and Engineering Year: 2018 Vol: 6 (10)Pages: 701-704
© 2026 ScienceGate Book Chapters — All rights reserved.