Feature Selection as Causal Inference: Experiments with Text Classification

Michael J. Paul

doi:10.18653/v1/k17-1018

ScienceGate Book Chapters

JOURNAL ARTICLE

Feature Selection as Causal Inference: Experiments with Text Classification

Michael J. Paul

Year: 2017 Pages: 163-172

DOI: 10.18653/v1/k17-1018

Get Full-Text PDF Get Analytical Report

Abstract

This paper proposes a matching technique for learning causal associations between word features and class labels in document classification. The goal is to identify more meaningful and generalizable features than with only correlational approaches. Experiments with sentiment classification show that the proposed method identifies interpretable word associations with sentiment and improves classification performance in a majority of cases. The proposed feature selection method is particularly effective when applied to out-of-domain data.

Keywords:

Artificial intelligence Computer science Feature selection Matching (statistics) Selection (genetic algorithm) Word (group theory) Feature (linguistics) Inference Class (philosophy) Machine learning Domain (mathematical analysis) Pattern recognition (psychology) Feature extraction Natural language processing Mathematics Statistics

Metrics

Cited By

3.67

FWCI (Field Weighted Citation Impact)

Refs

0.93

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Machine Learning and Algorithms

Physical Sciences → Computer Science → Artificial Intelligence

Bayesian Modeling and Causal Inference

Physical Sciences → Computer Science → Artificial Intelligence

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Feature Selection as Causal Inference: Experiments with Text Classification

Abstract

Metrics

Citation History

Topics

Related Documents

Feature selection in text classification: Identifying spurious words with causal inference methods

Feature Selection for High Dimensional Causal Inference

Feature selection with applications to text classification

Feature selection in text classification

Feature Selection for Text Classification