Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations

Pau Rodríguez; M. Caccia; Alexandre Lacoste; Lee Zamparo; Issam Laradji; Laurent Charlin; David Vázquez

doi:10.1109/iccv48922.2021.00109

ScienceGate Book Chapters

JOURNAL ARTICLE

Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations

Pau Rodríguez M. Caccia Alexandre Lacoste Lee Zamparo Issam Laradji Laurent Charlin David Vázquez

Year: 2021 Journal: 2021 IEEE/CVF International Conference on Computer Vision (ICCV) Pages: 1036-1045

DOI: 10.1109/iccv48922.2021.00109

Get Full-Text PDF Get Analytical Report

Abstract

Explainability for machine learning models has gained considerable attention within the research community given the importance of deploying more reliable machine-learning systems. In computer vision applications, generative counterfactual methods indicate how to perturb a model's input to change its prediction, providing details about the model's decision-making. Current methods tend to generate trivial counterfactuals about a model's decisions, as they often suggest to exaggerate or remove the presence of the attribute being classified. For the machine learning practitioner, these types of counterfactuals offer little value, since they provide no new information about undesired model or data biases. In this work, we identify the problem of trivial counterfactual generation and we propose DiVE to alleviate it. DiVE learns a perturbation in a disentangled latent space that is constrained using a diversity-enforcing loss to uncover multiple valuable explanations about the model's prediction. Further, we introduce a mechanism to prevent the model from producing trivial explanations. Experiments on CelebA and Synbols demonstrate that our model improves the success rate of producing high-quality valuable explanations when compared to previous state-of-the-art methods. Code is available at https://github.com/ElementAI/beyond-trivial-explanations.

Keywords:

Counterfactual thinking Computer science Data science Epistemology Philosophy

Metrics

Cited By

3.92

FWCI (Field Weighted Citation Impact)

108

Refs

0.95

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Scientific Computing and Data Management

Social Sciences → Decision Sciences → Information Systems and Management

Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations

Abstract

Metrics

Citation History

Topics

Related Documents

Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations

DiPACE: Diverse, Plausible and Actionable Counterfactual Explanations

Multi-Granular Evaluation of Diverse Counterfactual Explanations

Gradient-based Counterfactual Generation for Sparse and Diverse Counterfactual Explanations

From Visual Explanations to Counterfactual Explanations with Latent Diffusion