Targeted data augmentation for improving model robustness

Agnieszka Mikołajczyk-Bareła; Maria Ferlin; Michał Grochowski

doi:10.61822/amcs-2025-0011

ScienceGate Book Chapters

JOURNAL ARTICLE

Targeted data augmentation for improving model robustness

Agnieszka Mikołajczyk-Bareła Maria Ferlin Michał Grochowski

Year: 2025 Journal: International Journal of Applied Mathematics and Computer Science Vol: 35 (1) Publisher: De Gruyter Open

DOI: 10.61822/amcs-2025-0011

Get Full-Text PDF Get Analytical Report

Abstract

This paper proposes a new and effective bias mitigation method called targeted data augmentation (TDA). Since removing biases is often tedious and challenging and may not always lead to effective bias mitigation, we propose an alternative approach: skillfully inserting biases during the training to improve model robustness. To validate the proposed method, we applied TDA to two representative and diverse datasets: a clinical skin lesion dataset and a dataset of male and female faces. We identified and manually annotated existing instrument and sampling biases in these datasets, explicitly focusing on black frames and ruler marks in the skin lesion dataset and glasses in the face dataset. Using the counterfactual bias insertion (CBI) method, we confirmed that these biases strongly affect the model performance. By randomly inserting identified biases into training samples, we demonstrated that TDA significantly reduced bias measures by two times to more than 50 times, with only a negligible increase in the error rate. We performed our research on three model families: EfficientNet, DenseNet and Vision Transformer.

Keywords:

Robustness (evolution) Computer science Biology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.03

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Anomaly Detection Techniques and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Targeted data augmentation for improving model robustness

Abstract

Metrics

Topics

Related Documents

Improving Image Classification Robustness Using Predictive Data Augmentation

Improving Classifier Robustness through Active Generative Counterfactual Data Augmentation

AROID: Improving Adversarial Robustness Through Online Instance-Wise Data Augmentation

Improving Robustness to out-of-Distribution Data by Frequency-Based Augmentation

Improving Robustness of Defect Detection models using Adversarial-based Data Augmentation