JOURNAL ARTICLE

Class Imbalance Ensemble Learning Based on the Margin Theory

Wei FengWenjiang HuangJinchang Ren

Year: 2018 Journal:   Applied Sciences Vol: 8 (5)Pages: 815-815   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

The proportion of instances belonging to each class in a data-set plays an important role in machine learning. However, the real world data often suffer from class imbalance. Dealing with multi-class tasks with different misclassification costs of classes is harder than dealing with two-class ones. Undersampling and oversampling are two of the most popular data preprocessing techniques dealing with imbalanced data-sets. Ensemble classifiers have been shown to be more effective than data sampling techniques to enhance the classification performance of imbalanced data. Moreover, the combination of ensemble learning with sampling methods to tackle the class imbalance problem has led to several proposals in the literature, with positive results. The ensemble margin is a fundamental concept in ensemble learning. Several studies have shown that the generalization performance of an ensemble classifier is related to the distribution of its margins on the training examples. In this paper, we propose a novel ensemble margin based algorithm, which handles imbalanced classification by employing more low margin examples which are more informative than high margin samples. This algorithm combines ensemble learning with undersampling, but instead of balancing classes randomly such as UnderBagging, our method pays attention to constructing higher quality balanced sets for each base classifier. In order to demonstrate the effectiveness of the proposed method in handling class imbalanced data, UnderBagging and SMOTEBagging are used in a comparative analysis. In addition, we also compare the performances of different ensemble margin definitions, including both supervised and unsupervised margins, in class imbalance learning.

Keywords:
Undersampling Oversampling Machine learning Artificial intelligence Computer science Ensemble learning Margin (machine learning) Classifier (UML) Preprocessor Data mining Pattern recognition (psychology)

Metrics

141
Cited By
15.49
FWCI (Field Weighted Citation Impact)
85
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Electricity Theft Detection Techniques
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Financial Distress and Bankruptcy Prediction
Social Sciences →  Business, Management and Accounting →  Accounting

Related Documents

JOURNAL ARTICLE

Ensemble-based active learning for class imbalance problem

Yanping YangGuangzhi Ma

Journal:   Journal of Biomedical Science and Engineering Year: 2010 Vol: 03 (10)Pages: 1022-1029
JOURNAL ARTICLE

Imputation-based Ensemble Techniques for Class Imbalance Learning

Roozbeh Razavi‐FarMaryam Farajzadeh-ZanjaniBoyu WangMehrdad SaifShiladitya Chakrabarti

Journal:   IEEE Transactions on Knowledge and Data Engineering Year: 2019 Pages: 1-1
JOURNAL ARTICLE

Resampling-Based Ensemble Methods for Online Class Imbalance Learning

Shuo WangLeandro L. MinkuXin Yao

Journal:   IEEE Transactions on Knowledge and Data Engineering Year: 2014 Vol: 27 (5)Pages: 1356-1368
JOURNAL ARTICLE

Unsupervised Ensemble Learning for Class Imbalance Problems

Zihan LiuDongrui Wu

Year: 2018 Vol: 2 Pages: 3593-3600
© 2026 ScienceGate Book Chapters — All rights reserved.