JOURNAL ARTICLE

K Means Cluster Based Undersampling Ensemble for Imbalanced Data Classification

S. Santha SubbulaxmiG. Arumugam

Year: 2020 Journal:   International Journal of Engineering and Advanced Technology Vol: 9 (3)Pages: 2074-2079

Abstract

Imbalanced data classification is a critical and challenging problem in both data mining and machine learning. Imbalanced data classification problems present in many application areas like rare medical diagnosis, risk management, fault-detection, etc. The traditional classification algorithms yield poor results in imbalanced classification problems. In this paper, K-Means cluster based undersampling ensemble algorithm is proposed to solve the imbalanced data classification problem. The proposed method combines K-Means cluster based undersampling and boosting method. The experimental results show that the proposed algorithm outperforms the other sampling ensemble algorithms of previous studies.

Keywords:
Undersampling Boosting (machine learning) Computer science Data mining Statistical classification Artificial intelligence Data classification Ensemble learning Machine learning Cluster (spacecraft) Cluster analysis Pattern recognition (psychology)

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
23
Refs
0.28
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Electricity Theft Detection Techniques
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Artificial Intelligence in Healthcare
Health Sciences →  Health Professions →  Health Information Management
© 2026 ScienceGate Book Chapters — All rights reserved.