JOURNAL ARTICLE

Feature Selection Method Based on Weighted Mutual Information for Imbalanced Data

Kewen LiMingxiao YuLu LiuTiming LiJiannan Zhai

Year: 2018 Journal:   International Journal of Software Engineering and Knowledge Engineering Vol: 28 (08)Pages: 1177-1194   Publisher: World Scientific

Abstract

The class imbalance problem has negative effects on the performance of feature selection in imbalanced data. Traditional feature selection algorithms always study on the balanced class distribution of the data and improve the overall classification accuracy for the optimization goal, which tends to be overwhelmed by the large classes, ignoring the small ones. This paper proposes a novel feature selection method based on the weighted mutual information (WMI) for the imbalanced data, defined as WMI algorithm. The WMI algorithm assigns different weights to the samples based on the fuzzy c-means (FCM) clustering algorithm and then calculates the mutual information based on the weight of each sample. This paper used the AUC as the evaluation criterion of the selected feature. At last, four unbalanced datasets from NASA software defect datasets are used to validate the proposed approach. Experimental results show that the proposed method achieves higher prediction accuracy of both minority class and majority class.

Keywords:
Feature selection Mutual information Data mining Computer science Feature (linguistics) Cluster analysis Class (philosophy) Artificial intelligence Selection (genetic algorithm) Pattern recognition (psychology) Fuzzy logic Machine learning

Metrics

15
Cited By
1.39
FWCI (Field Weighted Citation Impact)
30
Refs
0.84
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Feature selection based on weighted conditional mutual information

Hongfang ZhouXiqian WangYao Zhang

Journal:   Applied Computing and Informatics Year: 2020 Vol: 20 (1/2)Pages: 55-68
BOOK-CHAPTER

Weighted Mutual Information for Feature Selection

Erik SchaffernichtHorst–Michael Groß

Lecture notes in computer science Year: 2011 Pages: 181-188
JOURNAL ARTICLE

Feature selection based on mutual information for gear imbalanced problem faulty diagnosis

T.Y. Liu

Journal:   2012 International Conference on System Simulation (ICUSS 2012) Year: 2012 Pages: 54-54
© 2026 ScienceGate Book Chapters — All rights reserved.