JOURNAL ARTICLE

Clustering-based subset ensemble learning method for imbalanced data

Abstract

In recent research, classification involving imbalanced datasets has received considerable attention. Most classification algorithms tend to predict that most of the incoming data belongs to the majority class, resulting in the poor classification performance in minority class instances, which are usually of much more interest. In this paper we propose a clustering-based subset ensemble learning method for handling class imbalanced problem. In the proposed approach, first, new balanced training datasets are produced using clustering-based under-sampling, then, further classification of new training sets are performed by applying four algorithms: Decision Tree, Naïve Bayes, KNN and SVM, as the base algorithms in combined-bagging. An experimental analysis is carried out over a wide range of highly imbalanced data sets. The results obtained show that our method can improve imbalance classification performance of rare and normal classes stably and effectively.

Keywords:
Cluster analysis Computer science Artificial intelligence Ensemble learning Decision tree Machine learning Naive Bayes classifier Support vector machine Statistical classification Data mining Class (philosophy) Pattern recognition (psychology) Range (aeronautics)

Metrics

10
Cited By
0.00
FWCI (Field Weighted Citation Impact)
15
Refs
0.10
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Electricity Theft Detection Techniques
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Rough Sets and Fuzzy Logic
Physical Sciences →  Computer Science →  Computational Theory and Mathematics

Related Documents

BOOK-CHAPTER

Imbalanced Data Classification Method Based on Ensemble Learning

Yu XiangYongping Xie

Lecture notes in electrical engineering Year: 2019 Pages: 18-24
JOURNAL ARTICLE

Ensemble learning method based on CNN for class imbalanced data

Xin ZhongNan Wang

Journal:   The Journal of Supercomputing Year: 2023 Vol: 80 (7)Pages: 10090-10121
BOOK-CHAPTER

Clustering-Based Ensemble Pruning in the Imbalanced Data Classification

Paweł Zyblewski

Lecture notes in computer science Year: 2021 Pages: 156-171
JOURNAL ARTICLE

KDE-Based Ensemble Learning for Imbalanced Data

Firuz KamalovSherif MoussaJorge Avante Reyes

Journal:   Electronics Year: 2022 Vol: 11 (17)Pages: 2703-2703
JOURNAL ARTICLE

Spark-based Ensemble Learning for Imbalanced Data Classification

Jiaman Ding

Journal:   International Journal of Performability Engineering Year: 2018
© 2026 ScienceGate Book Chapters — All rights reserved.