JOURNAL ARTICLE

Identifying Spurious Correlations for Robust Text Classification

Abstract

The predictions of text classifiers are often driven by spurious correlations – e.g., the term “Spielberg” correlates with positively reviewed movies, even though the term itself does not semantically convey a positive sentiment. In this paper, we propose a method to distinguish spurious and genuine correlations in text classification. We treat this as a supervised classification problem, using features derived from treatment effect estimators to distinguish spurious correlations from “genuine” ones. Due to the generic nature of these features and their small dimensionality, we find that the approach works well even with limited training examples, and that it is possible to transport the word classifier to new domains. Experiments on four datasets (sentiment classification and toxicity detection) suggest that using this approach to inform feature selection also leads to more robust classification, as measured by improved worst-case accuracy on the samples affected by spurious correlations.

Keywords:
Spurious relationship Computer science Artificial intelligence Classifier (UML) Curse of dimensionality Pattern recognition (psychology) Estimator Feature selection Machine learning Mathematics Statistics

Metrics

56
Cited By
6.90
FWCI (Field Weighted Citation Impact)
39
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Robust Interpretable Text Classification against Spurious Correlations Using AND-rules with Negation

Rohan Kumar YadavLei JiaoOle‐Christoffer GranmoMorten Goodwin

Journal:   Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence Year: 2022 Pages: 4439-4446
JOURNAL ARTICLE

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Zhao WangAron Culotta

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2021 Vol: 35 (16)Pages: 14024-14031
JOURNAL ARTICLE

Feature selection in text classification: Identifying spurious words with causal inference methods

Zixuan ZhaoHanyu LiJingke ChenYang LiJiayun Song

Journal:   Applied and Computational Engineering Year: 2023 Vol: 6 (1)Pages: 1522-1532
© 2026 ScienceGate Book Chapters — All rights reserved.