JOURNAL ARTICLE

On how to avoid exacerbating spurious correlations when models are overparameterized

Tina BehniaKe WangChristos Thrampoulidis

Year: 2022 Journal:   2022 IEEE International Symposium on Information Theory (ISIT)

Abstract

Overparameterized learning architectures fail to generalize well in the presence of data imbalance even when combined with traditional techniques for mitigating imbalances. This paper focuses on imbalanced classification datasets, in which a small subset of the population - a minority - may contain features that correlate spuriously with the class label. For a parametric family of cross-entropy loss modifications and a representative Gaussian mixture model, we derive non-asymptotic generalization bounds on the worst-group error that shed light on the role of different hyper-parameters. Specifically, we prove that, when appropriately tuned, the recently proposed VS-loss learns a model that is fair towards minorities even when spurious features are strong. On the other hand, alternative heuristics, such as the weighted CE and the LA-loss, can fail dramatically. Compared to previous works, our bounds hold for more general models, they are non-asymptotic, and, they apply even at scenarios of extreme imbalance.

Keywords:
Spurious relationship Generalization Heuristics Computer science Gaussian Parametric statistics Population Parametric model Entropy (arrow of time) Overfitting Cross entropy Algorithm Artificial intelligence Machine learning Mathematical optimization Mathematics Pattern recognition (psychology) Artificial neural network Statistics

Metrics

1
Cited By
0.12
FWCI (Field Weighted Citation Impact)
60
Refs
0.27
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability

Related Documents

JOURNAL ARTICLE

Unlearning from experience to avoid spurious correlations

Jeff MitchellJesús Martínez del RincónNiall McLaughlin

Journal:   Pattern Analysis and Applications Year: 2025 Vol: 28 (4)
JOURNAL ARTICLE

Use of Weight-Adjusted Oxygen Uptake Scores that Avoid Spurious Correlations

Victor L. KatchFrank I. Katch

Journal:   Research Quarterly American Alliance for Health Physical Education and Recreation Year: 1974 Vol: 45 (4)Pages: 447-451
JOURNAL ARTICLE

Spurious correlations

Journal:   Mitteilungen der Deutschen Mathematiker-Vereinigung Year: 2014 Vol: 22 (3)
JOURNAL ARTICLE

Spurious correlations

J. Samuel Smart

Journal:   Physics Today Year: 1981 Vol: 34 (5)Pages: 15-15
BOOK-CHAPTER

Spurious Correlations

The MIT Press eBooks Year: 2022 Pages: 103-120
© 2026 ScienceGate Book Chapters — All rights reserved.