JOURNAL ARTICLE

A Novel Clustering-Based Three Level Under-Sampling Algorithm for Class Imbalance Problem

Vibha PratapAmit Prakash Singh

Year: 2024 Journal:   DOAJ (DOAJ: Directory of Open Access Journals)

Abstract

The class imbalance is an important topic of research as imbalance exists in many applications where the presence of one type of sample is significantly greater than that of another type. To overcome binary class imbalance problems, a hybrid under-sampling approach based on k-mean clustering and pseudo-oversampling is proposed. Random Over-Sampling Examples (ROSE) aids in re-balancing an imbalanced dataset by creating minority samples using a smooth bootstrap method, and k-means clustering is used for better sample selection as each cluster contains examples having similar characteristics. It reduces the chance of elimination of useful majorityclass samples. For performance evaluation, 25 publicly available imbalanced datasets are collected from the KEEL repository. The proposed method improves classification results in terms of sensitivity, specificity, G-mean, F-measure, balance accuracy, and accuracy as compared to three state of art clustering-based undersampling methods SBC, KMUS, and OBU. The experimental results of this research can be used in the classification of various domains, such as medical diagnosis, banking fraud detection, anomaly detection, etc, which are generally imbalanced.

Keywords:
Undersampling Cluster analysis Class (philosophy) Pattern recognition (psychology) Statistical classification Cluster (spacecraft) Anomaly detection

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.49
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Cell Image Analysis Techniques
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Biophysics
Advanced Biosensing Techniques and Applications
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Artificial Immune Systems Applications
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

JOURNAL ARTICLE

Adaptive K-means clustering based under-sampling methods to solve the class imbalance problem

Qian ZhouBo Sun

Journal:   Data and Information Management Year: 2023 Vol: 8 (3)Pages: 100064-100064
JOURNAL ARTICLE

Novel fuzzy clustering-based undersampling framework for class imbalance problem

Vibha PratapAmit Singh

Journal:   International Journal of Systems Assurance Engineering and Management Year: 2023 Vol: 14 (3)Pages: 967-976
JOURNAL ARTICLE

SOM-US: A Novel Under-Sampling Technique for Handling Class Imbalance Problem

Ajay Kumar

Journal:   Journal of Communications Software and Systems Year: 2024 Vol: 20 (1)Pages: 69-75
JOURNAL ARTICLE

A majority affiliation based under-sampling method for class imbalance problem

Ying XieX. HuangFeng QinFagen LiXuyang Ding

Journal:   Information Sciences Year: 2024 Vol: 662 Pages: 120263-120263
© 2026 ScienceGate Book Chapters — All rights reserved.