JOURNAL ARTICLE

Feature Selection as Causal Inference: Experiments with Text Classification

Abstract

This paper proposes a matching technique for learning causal associations between word features and class labels in document classification. The goal is to identify more meaningful and generalizable features than with only correlational approaches. Experiments with sentiment classification show that the proposed method identifies interpretable word associations with sentiment and improves classification performance in a majority of cases. The proposed feature selection method is particularly effective when applied to out-of-domain data.

Keywords:
Artificial intelligence Computer science Feature selection Matching (statistics) Selection (genetic algorithm) Word (group theory) Feature (linguistics) Inference Class (philosophy) Machine learning Domain (mathematical analysis) Pattern recognition (psychology) Feature extraction Natural language processing Mathematics Statistics

Metrics

36
Cited By
3.67
FWCI (Field Weighted Citation Impact)
46
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Machine Learning and Algorithms
Physical Sciences →  Computer Science →  Artificial Intelligence
Bayesian Modeling and Causal Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Feature selection in text classification: Identifying spurious words with causal inference methods

Zixuan ZhaoHanyu LiJingke ChenYang LiJiayun Song

Journal:   Applied and Computational Engineering Year: 2023 Vol: 6 (1)Pages: 1522-1532
JOURNAL ARTICLE

Feature selection with applications to text classification

Endre Boros

Journal:   Rutgers University Community Repository (Rutgers University) Year: 2012
BOOK-CHAPTER

Feature Selection for Text Classification

George Forman

Chapman & Hall/CRC data mining and knowledge discovery series Year: 2007 Pages: 257-276
© 2026 ScienceGate Book Chapters — All rights reserved.