JOURNAL ARTICLE

An Ensemble Framework of Multi-ratio Undersampling-based Imbalanced Classification

Takahiro KomamizuYasuhiro OgawaKatsuhiko Toyama

Year: 2021 Journal:   Journal of Data Intelligence Vol: 2 (1)Pages: 30-46   Publisher: Rinton Press

Abstract

Class imbalance is commonly observed in real-world data, and it is problematic in that it degrades classification performance due to biased supervision. Undersampling is an effective resampling approach to the class imbalance. The conventional undersampling-based approaches involve a single fixed sampling ratio. However, different sampling ratios have different preferences toward classes. In this paper, an undersampling-based ensemble framework, MUEnsemble, is proposed. This framework involves weak classifiers of different sampling ratios, and it allows for a flexible design for weighting weak classifiers in different sampling ratios. To demonstrate the principle of the design, in this paper, a uniform weighting function and a Gaussian weighting function are presented. An extensive experimental evaluation shows that MUEnsemble outperforms undersampling-based and oversampling-based state-of-the-art methods in terms of recall, gmean, F-measure, and ROC-AUC metrics. Also, the evaluation showcases that the Gaussian weighting function is superior to the uniform weighting function. This indicates that the Gaussian weighting function can capture the different preferences of sampling ratios toward classes. An investigation into the effects of the parameters of the Gaussian weighting function shows that the parameters of this function can be chosen in terms of recall, which is preferred in many real-world applications.

Keywords:
Undersampling Weighting Oversampling Gaussian Computer science Function (biology) Resampling Sampling (signal processing) Gaussian function Artificial intelligence Pattern recognition (psychology) Machine learning Statistics Mathematics

Metrics

2
Cited By
0.28
FWCI (Field Weighted Citation Impact)
27
Refs
0.61
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Hashing-Based Undersampling Ensemble for Imbalanced Pattern Classification Problems

Wing W. Y. NgShichao XuJianjun ZhangXing TianTongwen RongSam Kwong

Journal:   IEEE Transactions on Cybernetics Year: 2020 Vol: 52 (2)Pages: 1269-1279
JOURNAL ARTICLE

Combining Multi-ratio Undersampling and Metric Learning for Imbalanced Classification

Takahiro Komamizu

Journal:   Journal of Data Intelligence Year: 2021 Vol: 2 (4)Pages: 462-475
JOURNAL ARTICLE

K Means Cluster Based Undersampling Ensemble for Imbalanced Data Classification

S. Santha SubbulaxmiG. Arumugam

Journal:   International Journal of Engineering and Advanced Technology Year: 2020 Vol: 9 (3)Pages: 2074-2079
© 2026 ScienceGate Book Chapters — All rights reserved.