JOURNAL ARTICLE

Kernel-based SMOTE for SVM classification of imbalanced datasets

Abstract

Datasets with an imbalanced class distribution pose a severe challenge to traditional learning algorithms that are designed to improve overall classification accuracy. Preprocessing methods like Synthetic Minority Over-sampling Technique (SMOTE) address this problem by generating data points in the input space to balance the training dataset. However, such artificial sampling methods can distort the performance of Support Vector Machine (SVM) classifiers that operate in a kernel induced feature space. This paper proposes a kernel-based SMOTE (K-SMOTE) algorithm that directly generates synthetically minority data points in the feature space of SVM classifier. The new data points are added by augmenting the original Gram matrix based on neighbourhood information in the feature space. The proposed algorithm is statistically shown to improve performance on 51 benchmark datasets. K-SMOTE is further applied to predict the stage of degradation in a semiconductor etching chamber where it achieves a higher accuracy for the imbalanced faulty stages.

Keywords:
Support vector machine Computer science Artificial intelligence Preprocessor Pattern recognition (psychology) Kernel (algebra) Feature vector Classifier (UML) Machine learning Data mining Mathematics

Metrics

76
Cited By
3.77
FWCI (Field Weighted Citation Impact)
20
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Industrial Vision Systems and Defect Detection
Physical Sciences →  Engineering →  Industrial and Manufacturing Engineering
Non-Destructive Testing Techniques
Physical Sciences →  Engineering →  Mechanical Engineering

Related Documents

JOURNAL ARTICLE

Improving Classification of Imbalanced Datasets Based on KM++ SMOTE Algorithm

Li ChenPing DongWei SuYan Zhang

Journal:   2019 2nd International Conference on Safety Produce Informatization (IICSPI) Year: 2019 Pages: 300-306
BOOK-CHAPTER

SMOTE Based Weighted Kernel Extreme Learning Machine for Imbalanced Classification Problems

Roshani ChoudharySanyam Shukla

Advances in intelligent systems and computing Year: 2021 Pages: 193-200
JOURNAL ARTICLE

Gaussian Based-SMOTE Method for Handling Imbalanced Small Datasets

Muhammad MisdramEdi NoersasongkoPurwanto PurwantoMuljono MuljonoFandi Yulian Pamuji

Journal:   Jurnal Ilmiah Teknik Elektro Komputer dan Informatika Year: 2023 Vol: 9 (4)Pages: 973-982
© 2026 ScienceGate Book Chapters — All rights reserved.