Empirical Analysis of Methods for Evaluating Faithfulness of Explanations by Feature Attribution

Yuya Asazuma; Kazuaki Hanawa; Kentaro Inui

doi:10.1527/tjsai.38-6_c-n22

ScienceGate Book Chapters

JOURNAL ARTICLE

Empirical Analysis of Methods for Evaluating Faithfulness of Explanations by Feature Attribution

Yuya Asazuma Kazuaki Hanawa Kentaro Inui

Year: 2023 Journal: Transactions of the Japanese Society for Artificial Intelligence Vol: 38 (6)Pages: C-N22_1 Publisher: The Japanese Society for Artificial Intelligence

DOI: 10.1527/tjsai.38-6_c-n22

Get Full-Text PDF Get Analytical Report

Abstract

Many high-performance machine learning models in the real world exhibit the black box problem. This issue is widely recognized as needing output reliability and model transparency. XAI (Explainable AI) represents a research field that addresses this issue. Within XAI, feature attribution methods, which clarify the importance of features irrespective of the task or model type, have become a central focus. Evaluating their efficacy based on empirical evidence is essential when proposing new methods. However, extensive debate exists regarding the properties that importance should be possessed, and a consensus on specific evaluation methods remains elusive. Given this context, many existing studies adopt their evaluation techniques, leading to fragmented discussions. This study aims to "evaluate the evaluation methods," focusing mainly on the faithfulness metric, deemed especially significant in evaluation criteria. We conducted empirical experiments related to existing evaluation techniques. The experiments approached the topic from two angles: correlation-based comparative evaluations and property verification using random sequences. In the former experiment, we investigated the correlation between faithfulness evaluation tests using numerous models and feature attribution methods. As a result, we found that very few test combinations exhibited high correlation, and many combinations showed low or no correlation. In the latter experiment, we observed that the measured faithfulness varied depending on the model and dataset by using random sequences instead of feature attribution methods to verify the properties of the faithfulness tests.

Keywords:

Computer science Feature (linguistics) Correlation Context (archaeology) Artificial intelligence Machine learning Empirical research Metric (unit) Reliability (semiconductor) Data mining Statistics Mathematics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.14

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

Machine Learning and Data Classification

Physical Sciences → Computer Science → Artificial Intelligence

Empirical Analysis of Methods for Evaluating Faithfulness of Explanations by Feature Attribution

Abstract

Metrics

Topics

Related Documents

TextFocus: Assessing the Faithfulness of Feature Attribution Methods Explanations in Natural Language Processing

Plausibility and Faithfulness of Feature Attribution-Based Explanations in Automated Short Answer Scoring

Evaluating explainability in language classification models: A unified framework incorporating feature attribution methods and key factors affecting faithfulness

Evaluating Readability and Faithfulness of Concept-based Explanations

Can We Really Trust Explanations? Evaluating the Stability of Feature Attribution Explanation Methods via Adversarial Attack