JOURNAL ARTICLE

Adaptive Synthetic Oversampling Algorithm for Handling Class Imbalance in Multi-Class Data Stream Classification

S. PriyaAnnie Uthra

Year: 2022 Journal:   Journal of Computer Science Vol: 18 (7)Pages: 650-664   Publisher: Science Publications

Abstract

Concept drift and class imbalanced data are major challenging processes involved in modern streaming data classification. Particularly, when integrated with difficult factors like the existence of noise, overlapping class distribution, concept drift, and data imbalance can considerably affect the classifier results. In addition, various challenges affect the performance of the existing oversampling schemes such as SMOTE and its derivatives. Regardless of that, several existing models concentrate on the data imbalance in the binary classification problems, whereas the complex multi-class counterparts are yet to be explored. With this motivation, this study develops an Adaptive Synthetic Oversampling Algorithm (ASYNO) based Multiclass Streaming Data Classification (ASYNO-MCSDC) model on Class Imbalance Handling and Concept Drift. The presented ASYNO-MCSDC method initially performs different stages of preprocessing such as label encoding, data normalization, and data splitting. Besides, the Adaptive Synthetic oversampling technique (ASYNO) is applied for handling class imbalance data problems. Also, the online bagging ensemble classifier is employed for the data classification process in which the Hoeffding Tree (HT) was utilized as the base classification and the number of estimators used in online bagging is set to 10. For the process of experimentation, two types of learning are used, one is batch learning and other is incremental learning. The experimental validation of the ASYNO-MCSDC model is tested using two datasets namely stationary imbalance stream and dynamic imbalance stream. The experimental results pointed out that the ASYNO-MCSDC model has accomplished promising results over other models.

Keywords:
Oversampling Computer science Concept drift Data stream Artificial intelligence Machine learning Classifier (UML) Preprocessor Data mining Data pre-processing Algorithm Multiclass classification One-class classification Synthetic data Data classification Data stream mining Pattern recognition (psychology) Support vector machine Bandwidth (computing)

Metrics

3
Cited By
0.59
FWCI (Field Weighted Citation Impact)
27
Refs
0.67
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Time Series Analysis and Forecasting
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Clustering-Based Oversampling Algorithm for Multi-class Imbalance Learning

Haixia ZhaoJian Wu

Journal:   Journal of Classification Year: 2024 Vol: 42 (1)Pages: 205-220
JOURNAL ARTICLE

Sampling Safety Coefficient for Multi-class Imbalance Oversampling Algorithm

Minggang Liu

Journal:   DOAJ (DOAJ: Directory of Open Access Journals) Year: 2020
© 2026 ScienceGate Book Chapters — All rights reserved.