JOURNAL ARTICLE

IRS-BAG-Integrated Radius-SMOTE Algorithm with Bagging Ensemble Learning Model for Imbalanced Data Set Classification

Abstract

Imbalanced learning problems are a challenge faced by classifiers when data samples have an unbalanced distribution among classes. The Synthetic Minority Over-Sampling Technique (SMOTE) is one of the most well-known data pre-processing methods. Problems that arise when oversampling with SMOTE are the phenomenon of noise, small disjunct samples, and overfitting due to a high imbalance ratio in a dataset. A high level of imbalance ratio and low variance conditions cause the results of synthetic data generation to be collected in narrow areas and conflicting regions among classes and make them susceptible to overfitting during the learning process by machine learning methods. Therefore, this research proposes a combination between Radius-SMOTE and Bagging Algorithm called the IRS-BAG Model. For each sub-sample generated by bootstrapping, oversampling was done using Radius SMOTE. Oversampling on the sub-sample was likely to overcome overfitting problems that might occur. Experiments were carried out by comparing the performance of the IRS-BAG model with various previous oversampling methods using the imbalanced public dataset. The experiment results using three different classifiers proved that all classifiers had gained a notable improvement when combined with the proposed IRS-BAG model compared with the previous state-of-the-art oversampling methods. Doi: 10.28991/ESJ-2023-07-05-04 Full Text: PDF

Keywords:
Oversampling Overfitting Bootstrapping (finance) Artificial intelligence Machine learning Computer science Sample (material) Algorithm Set (abstract data type) Sampling (signal processing) Data set Data mining Pattern recognition (psychology) Mathematics Artificial neural network Detector Bandwidth (computing)

Metrics

9
Cited By
2.30
FWCI (Field Weighted Citation Impact)
32
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management
Currency Recognition and Detection
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

BOOK-CHAPTER

Bagging-Based Ensemble Learning for Imbalanced Data Classification Problem

M. Govindarajan

Auerbach Publications eBooks Year: 2023 Pages: 57-64
JOURNAL ARTICLE

A Novel Ensemble Method for Imbalanced Data Learning: Bagging of Extrapolation-SMOTE SVM

Qi WangZhihao LuoJincai HuangYanghe FengZhong Liu

Journal:   Computational Intelligence and Neuroscience Year: 2017 Vol: 2017 Pages: 1-11
BOOK-CHAPTER

Imbalanced Data Classification Algorithm Based on Integrated Sampling and Ensemble Learning

Han YanMing-Xiang HeQixian Lu

Advances in intelligent systems and computing Year: 2019 Pages: 615-624
JOURNAL ARTICLE

A stacking ensemble model with SMOTE for improved imbalanced classification on credit data

Nur AlamsyahBudiman BudimanTitan Parama YogaR. Yadi Rakhman Alamsyah

Journal:   TELKOMNIKA (Telecommunication Computing Electronics and Control) Year: 2024 Vol: 22 (3)Pages: 657-657
© 2026 ScienceGate Book Chapters — All rights reserved.