JOURNAL ARTICLE

Semi-supervised counterfactual explanations

Abstract

Counterfactual explanations for machine learning models are used to find minimal interventions to the feature values such that the model changes the prediction to a different output or a target output. A valid counterfactual explanation should have likely feature values. Here, we address the challenge of generating counterfactual explanations that lie in the same data distribution as that of the training data and more importantly, they belong to the target class distribution. This requirement has been addressed through the incorporation of auto-encoder reconstruction loss in the counterfactual search process. Connecting the output behavior of the classifier to the latent space of the auto-encoder has further improved the speed of the counterfactual search process and the interpretability of the resulting counterfactual explanations. Continuing this line of research, we show further improvement in the interpretability of counterfactual explanations when the auto-encoder is trained in a semi-supervised fashion with class tagged input data. We empirically evaluate our approach on several datasets and show considerable improvement in-terms of several metrics.

Keywords:
Counterfactual thinking Interpretability Machine learning Artificial intelligence Computer science Classifier (UML) Feature (linguistics) Encoder Econometrics Mathematics Psychology Social psychology

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Explainable Artificial Intelligence (XAI)
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning in Materials Science
Physical Sciences →  Materials Science →  Materials Chemistry

Related Documents

JOURNAL ARTICLE

On Generating Plausible Counterfactual and Semi-Factual Explanations for Deep Learning

Eoin M. KennyMark T. Keane

Journal:   arXiv (Cornell University) Year: 2020 Pages: 11575-11585
JOURNAL ARTICLE

On Generating Plausible Counterfactual and Semi-Factual Explanations for Deep Learning

Eoin M. KennyMark T. Keane

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2021 Vol: 35 (13)Pages: 11575-11585
BOOK-CHAPTER

Latent Diffusion Counterfactual Explanations

Simon SchrodiKarim FaridMax ArgusThomas Brox

Lecture notes in computer science Year: 2025 Pages: 295-311
JOURNAL ARTICLE

Counterfactual Shapley Additive Explanations

Emanuele AlbiniJason LongDanial DervovicDaniele Magazzeni

Journal:   2022 ACM Conference on Fairness, Accountability, and Transparency Year: 2022 Pages: 1054-1070
© 2026 ScienceGate Book Chapters — All rights reserved.