Semi-supervised counterfactual explanations

Surya Shravan Kumar Sajja; Sumanta Mukherjee; Satyam Dwivedi

doi:10.48550/arxiv.2303.12634

ScienceGate Book Chapters

JOURNAL ARTICLE

Semi-supervised counterfactual explanations

Surya Shravan Kumar Sajja Sumanta Mukherjee Satyam Dwivedi

Year: 2023 Journal: arXiv (Cornell University) Publisher: Cornell University

DOI: 10.48550/arxiv.2303.12634

Get Full-Text PDF Get Analytical Report

Abstract

Counterfactual explanations for machine learning models are used to find minimal interventions to the feature values such that the model changes the prediction to a different output or a target output. A valid counterfactual explanation should have likely feature values. Here, we address the challenge of generating counterfactual explanations that lie in the same data distribution as that of the training data and more importantly, they belong to the target class distribution. This requirement has been addressed through the incorporation of auto-encoder reconstruction loss in the counterfactual search process. Connecting the output behavior of the classifier to the latent space of the auto-encoder has further improved the speed of the counterfactual search process and the interpretability of the resulting counterfactual explanations. Continuing this line of research, we show further improvement in the interpretability of counterfactual explanations when the auto-encoder is trained in a semi-supervised fashion with class tagged input data. We empirically evaluate our approach on several datasets and show considerable improvement in-terms of several metrics.

Keywords:

Counterfactual thinking Interpretability Machine learning Artificial intelligence Computer science Classifier (UML) Feature (linguistics) Encoder Econometrics Mathematics Psychology Social psychology

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

Machine Learning and Data Classification

Physical Sciences → Computer Science → Artificial Intelligence

Machine Learning in Materials Science

Physical Sciences → Materials Science → Materials Chemistry

Semi-supervised counterfactual explanations

Abstract

Metrics

Topics

Related Documents

Counterfactual Propagation for Semi-supervised Individual Treatment Effect Estimation

On Generating Plausible Counterfactual and Semi-Factual Explanations for Deep Learning

On Generating Plausible Counterfactual and Semi-Factual Explanations for Deep Learning

Latent Diffusion Counterfactual Explanations

Counterfactual Shapley Additive Explanations