JOURNAL ARTICLE

Imbalanced Learning Based on Data-Partition and SMOTE

Huaping GuoJun ZhouChang-an Wu

Year: 2018 Journal:   Information Vol: 9 (9)Pages: 238-238   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Classification of data with imbalanced class distribution has encountered a significant drawback by most conventional classification learning methods which assume a relatively balanced class distribution. This paper proposes a novel classification method based on data-partition and SMOTE for imbalanced learning. The proposed method differs from conventional ones in both the learning and prediction stages. For the learning stage, the proposed method uses the following three steps to learn a class-imbalance oriented model: (1) partitioning the majority class into several clusters using data partition methods such as K-Means, (2) constructing a novel training set using SMOTE on each data set obtained by merging each cluster with the minority class, and (3) learning a classification model on each training set using convention classification learning methods including decision tree, SVM and neural network. Therefore, a classifier repository consisting of several classification models is constructed. With respect to the prediction stage, for a given example to be classified, the proposed method uses the partition model constructed in the learning stage to select a model from the classifier repository to predict the example. Comprehensive experiments on KEEL data sets show that the proposed method outperforms some other existing methods on evaluation measures of recall, g-mean, f-measure and AUC.

Keywords:
Computer science Artificial intelligence Machine learning Partition (number theory) Classifier (UML) Decision tree Support vector machine Data mining Artificial neural network Multiclass classification Pattern recognition (psychology) Mathematics

Metrics

24
Cited By
2.18
FWCI (Field Weighted Citation Impact)
60
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Electricity Theft Detection Techniques
Physical Sciences →  Engineering →  Electrical and Electronic Engineering

Related Documents

JOURNAL ARTICLE

Surrounding neighborhood-based SMOTE for learning from imbalanced data sets

Vicente GarcíaJ. Salvador SánchezRaúl Martín-FélezRamón A. Mollineda

Journal:   Progress in Artificial Intelligence Year: 2012 Vol: 1 (4)Pages: 347-362
JOURNAL ARTICLE

Imbalanced Classification Based on Active Learning SMOTE

Ying Mi

Journal:   Research Journal of Applied Sciences Engineering and Technology Year: 2013 Vol: 5 (3)Pages: 944-949
JOURNAL ARTICLE

PDR-SMOTE: an imbalanced data processing method based on data region partition and K nearest neighbors

Hongfang ZhouZongling WuNingning XuHao Xiao

Journal:   International Journal of Machine Learning and Cybernetics Year: 2023 Vol: 14 (12)Pages: 4135-4150
© 2026 ScienceGate Book Chapters — All rights reserved.