JOURNAL ARTICLE

Classifier Ensemble for Imbalanced Data Stream Classification

Abstract

The data streams in various real life applications are characterized by concept drift. Such data streams may also be characterized by skewed or imbalance class distributions for example Financial fraud detection, Network intrusion detection etc. In such cases skewed class distribution of the stream increases the problems associated with classifying stream instances. Learning from such skewed data streams results in a classifier which is biased towards the majority class. Thus the classifier built on such skewed data streams tends to misclassify the minority class examples. In case of some applications like financial fraud detection the identification of fraudulent transaction is the main focus because here misclassification of such minority class instances will result in financial loss. Similarly in case of many other real life data stream applications the misclassification costs associated with minority class instances are higher and they need proactive treatment. In this paper we present our preliminary work where in we propose a method which makes use of k nearest neighbours and oversampling technique to balance the class distributions. Experimental results show that the approach shows good classification performance on synthetic and real world data sets.

Keywords:
Computer science Data stream Classifier (UML) Concept drift Oversampling Data stream mining Artificial intelligence Data mining Machine learning Intrusion detection system Bandwidth (computing)

Metrics

12
Cited By
1.52
FWCI (Field Weighted Citation Impact)
26
Refs
0.86
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Anomaly Detection Techniques and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Classifier Ensemble for Uncertain Data Stream Classification

Shirui PanKuan WuYang ZhangXue Li

Lecture notes in computer science Year: 2010 Pages: 488-495
JOURNAL ARTICLE

An Ensemble Tree Classifier for Highly Imbalanced Data Classification

Peibei ShiZhong Wang

Journal:   Journal of Systems Science and Complexity Year: 2021 Vol: 34 (6)Pages: 2250-2266
JOURNAL ARTICLE

Hellinger Distance Weighted Ensemble for imbalanced data stream classification

Joanna GrzybJakub KlikowskiMichał Woźniak

Journal:   Journal of Computational Science Year: 2021 Vol: 51 Pages: 101314-101314
© 2026 ScienceGate Book Chapters — All rights reserved.