JOURNAL ARTICLE

Feature Selection Method from Multiclass Text with Class Imbalance Problem

Minji SeoGilseung AhnSun Hur

Year: 2019 Journal:   Journal of Korean Institute of Industrial Engineers Vol: 45 (2)Pages: 93-100

Abstract

A text classification model in which one of the class variables is biased to the majority class typically classifies most documents into the majority class to enhance the overall classification accuracy. It is called a class imbalance problem. This study proposes a feature selection method based on simplified chi-square statistics to select features in each class for developing a robust model to the problem. Proposed method and typical feature selection methods are compared by Reuter21578 data. Experiment shows that the proposed method is superior to typical feature selection methods in terms of naïve Bayes and support vector machine which are robust to the class imbalance problem.

Keywords:
Feature selection Class (philosophy) Feature (linguistics) Artificial intelligence Computer science Multiclass classification Pattern recognition (psychology) Support vector machine Selection (genetic algorithm) Naive Bayes classifier Machine learning Data mining

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.05
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Improved Feature-Selection Method Considering the Imbalance Problem in Text Categorization

Jieming YangZhaoyang QuZhiying Liu

Journal:   The Scientific World JOURNAL Year: 2014 Vol: 2014 Pages: 1-17
JOURNAL ARTICLE

Assessing feature selection method performance with class imbalance data

Surani MatharaarachchiMichael DomaratzkiSaman Muthukumarana

Journal:   Machine Learning with Applications Year: 2021 Vol: 6 Pages: 100170-100170
BOOK-CHAPTER

Cost-Sensitive Feature Selection for Class Imbalance Problem

Małgorzata BachAleksandra Werner

Advances in intelligent systems and computing Year: 2017 Pages: 182-194
JOURNAL ARTICLE

ENSEMBLE META CLASSIFIER WITH SAMPLING AND FEATURE SELECTION FOR DATA WITH IMBALANCE MULTICLASS PROBLEM

Mohd Shamrie SaininRayner AlfredFaudziah Ahmad

Journal:   Journal of Information and Communication Technology Year: 2021 Vol: 20
JOURNAL ARTICLE

Combating the Small Sample Class Imbalance Problem Using Feature Selection

Mike WasikowskiXuewen Chen

Journal:   IEEE Transactions on Knowledge and Data Engineering Year: 2009 Vol: 22 (10)Pages: 1388-1400
© 2026 ScienceGate Book Chapters — All rights reserved.