Analyzing Biases to Spurious Correlations in Text Classification Tasks

Adian Liusie; Vatsal Raina; Vyas Raina; Mark Gales

doi:10.17863/cam.88892.2

ScienceGate Book Chapters

JOURNAL ARTICLE

Analyzing Biases to Spurious Correlations in Text Classification Tasks

Adian Liusie Vatsal Raina Vyas Raina Mark Gales

Year: 2022 Pages: 78-84

DOI: 10.17863/cam.88892.2

Get Full-Text PDF Get Analytical Report

Abstract

Machine learning systems have shown impressive performance across a range of natural language tasks. However, it has been hypothesized that these systems are prone to learning spurious correlations that may be present in the training data. Though these correlations will not impact in-domain performance, they are unlikely to generalize well to out-of-domain data, limiting the applicability of systems. This work examines this phenomenon on text classification tasks. Rather than artificially injecting features into the data, we demonstrate that real spurious correlations can be exploited by current stateof-the-art deep-learning systems. Specifically, we show that even when only ‘stop’ words are available at the input stage, it is possible to predict the class significantly better than random. Though it is shown that these stop words are not required for good in-domain performance, they can degrade the ability of the system to generalize well to out-of-domain data

Keywords:

Spurious relationship Computer science Artificial intelligence Data mining Statistics Machine learning Mathematics

Metrics

Cited By

0.78

FWCI (Field Weighted Citation Impact)

Refs

0.73

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Analyzing Biases to Spurious Correlations in Text Classification Tasks

Abstract

Metrics

Citation History

Topics

Related Documents

Identifying Spurious Correlations for Robust Text Classification

Fighting Spurious Correlations in Text Classification via a Causal Learning Perspective

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Robust Interpretable Text Classification against Spurious Correlations Using AND-rules with Negation

Explore Spurious Correlations at the Concept Level in Language Models for Text Classification