Identifying Spurious Correlations for Robust Text Classification

Zhao Wang; Aron Culotta

doi:10.18653/v1/2020.findings-emnlp.308

ScienceGate Book Chapters

JOURNAL ARTICLE

Identifying Spurious Correlations for Robust Text Classification

Zhao Wang Aron Culotta

Year: 2020 Pages: 3431-3440

DOI: 10.18653/v1/2020.findings-emnlp.308

Get Full-Text PDF Get Analytical Report

Abstract

The predictions of text classifiers are often driven by spurious correlations – e.g., the term “Spielberg” correlates with positively reviewed movies, even though the term itself does not semantically convey a positive sentiment. In this paper, we propose a method to distinguish spurious and genuine correlations in text classification. We treat this as a supervised classification problem, using features derived from treatment effect estimators to distinguish spurious correlations from “genuine” ones. Due to the generic nature of these features and their small dimensionality, we find that the approach works well even with limited training examples, and that it is possible to transport the word classifier to new domains. Experiments on four datasets (sentiment classification and toxicity detection) suggest that using this approach to inform feature selection also leads to more robust classification, as measured by improved worst-case accuracy on the samples affected by spurious correlations.

Keywords:

Spurious relationship Computer science Artificial intelligence Classifier (UML) Curse of dimensionality Pattern recognition (psychology) Estimator Feature selection Machine learning Mathematics Statistics

Metrics

Cited By

6.90

FWCI (Field Weighted Citation Impact)

Refs

0.97

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Sentiment Analysis and Opinion Mining

Physical Sciences → Computer Science → Artificial Intelligence

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Identifying Spurious Correlations for Robust Text Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Robust Interpretable Text Classification against Spurious Correlations Using AND-rules with Negation

Analyzing Biases to Spurious Correlations in Text Classification Tasks

Fighting Spurious Correlations in Text Classification via a Causal Learning Perspective

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Feature selection in text classification: Identifying spurious words with causal inference methods