Cost-sensitive learning for imbalanced data streams

Lucas Loezer; Fabrício Enembreck; Jean Paul Barddal; Alceu de Souza Britto

doi:10.1145/3341105.3373949

ScienceGate Book Chapters

JOURNAL ARTICLE

Cost-sensitive learning for imbalanced data streams

Lucas Loezer Fabrício Enembreck Jean Paul Barddal Alceu de Souza Britto

Year: 2020 Pages: 498-504

DOI: 10.1145/3341105.3373949

Get Full-Text PDF Get Analytical Report

Abstract

The data imbalance problem hampers the classification task. In streaming environments, this becomes even more cumbersome as the proportion of classes can vary over time. Approaches based on misclassification costs can be used to mitigate this problem. In this paper, we present the Cost-sensitive Adaptive Random Forest (CSARF) and compare it to the Adaptive Random Forest (ARF) and ARF with Resampling (ARFRE) in six real-world and six synthetic data sets with different class ratios. The empirical study analyzes two misclassification costs strategies of the CSARF and shows that the CSARF obtained statistically superior w.r.t. the average recall and average F1 when compared to ARF.

Keywords:

Resampling Computer science Random forest Task (project management) Machine learning Artificial intelligence Data stream Data stream mining Recall Data mining Engineering

Metrics

Cited By

2.94

FWCI (Field Weighted Citation Impact)

Refs

0.92

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Data Stream Mining Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Imbalanced Data Classification Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Water Systems and Optimization

Physical Sciences → Engineering → Civil and Structural Engineering

Cost-sensitive learning for imbalanced data streams

Abstract

Metrics

Citation History

Topics

Related Documents

Cost-sensitive sparse group online learning for imbalanced data streams

Cost-sensitive continuous ensemble kernel learning for imbalanced data streams with concept drift

Cost-sensitive learning methods for imbalanced data

Cost-Sensitive Perceptron Decision Trees for Imbalanced Drifting Data Streams

Analysis of imbalanced data using cost-sensitive learning