JOURNAL ARTICLE

DOSS: Dual Over Sampling Strategy for Imbalanced Data Classification

Abstract

Imbalanced datasets are often encountered in process monitoring, where the data reflecting abnormal events like machine failures is less than the data reflecting normal events. The former is called the minority class and the later is referred as the majority class. Classical machine learning algorithms are still facing challenges in solving this problem. In order to improve the classification accuracy, oversampling techniques rebalance the dataset by supplying the minority class with synthetic samples. However, the latent sample spaces of both classes are broad, the majority class might be under-represented as well. In this paper, we propose a dual oversampling strategy (DOSS) to generate samples for both classes. For the majority class, synthetic samples are generated according to the data distribution, which is approximated by conditional Generative Adversarial Network (cGAN). For the minority class, Synthetic Minority Over-sampling Technique (SMOTE) is applied as the oversampling method. The proposed strategy is compared with others that either only the minority class is oversampled or both classes are oversampled with different strategies. Recall, G-mean and F-measure are used as the metrics. The experimental results on 12 benchmark datasets show the improved performance of our proposed strategy. DOSS is further applied to detect the faulty stages of an injection moulding machine where the prediction of DOSS achieves a better accuracy.

Keywords:
Oversampling Artificial intelligence Benchmark (surveying) Machine learning Computer science Class (philosophy) Sampling (signal processing) Dual (grammatical number) Adversarial system Pattern recognition (psychology) Bandwidth (computing)

Metrics

2
Cited By
0.20
FWCI (Field Weighted Citation Impact)
13
Refs
0.61
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Electricity Theft Detection Techniques
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Industrial Vision Systems and Defect Detection
Physical Sciences →  Engineering →  Industrial and Manufacturing Engineering

Related Documents

JOURNAL ARTICLE

Borderline over-sampling for imbalanced data classification

Hien M. NguyenEric W. CooperKatsuari Kamei

Journal:   International Journal of Knowledge Engineering and Soft Data Paradigms Year: 2011 Vol: 3 (1)Pages: 4-4
JOURNAL ARTICLE

Over-sampling algorithm for imbalanced data classification

Xiaolong XuWen ChenYanfei Sun

Journal:   Journal of Systems Engineering and Electronics Year: 2019 Vol: 30 (6)Pages: 1182-1191
JOURNAL ARTICLE

DPC-SMOTE Over-sampling Algorithm for Imbalanced Data Classification

LIU ZhihanZHANG ZhonglinZHAO Lei

Journal:   DOAJ (DOAJ: Directory of Open Access Journals) Year: 2024
JOURNAL ARTICLE

Multiple adaptive over-sampling for imbalanced data evidential classification

Zhen ZhangHongpeng TianJin-shuai Jin

Journal:   Engineering Applications of Artificial Intelligence Year: 2024 Vol: 133 Pages: 108532-108532
© 2026 ScienceGate Book Chapters — All rights reserved.