JOURNAL ARTICLE

Empirical Analysis of Methods for Evaluating Faithfulness of Explanations by Feature Attribution

Yuya AsazumaKazuaki HanawaKentaro Inui

Year: 2023 Journal:   Transactions of the Japanese Society for Artificial Intelligence Vol: 38 (6)Pages: C-N22_1   Publisher: The Japanese Society for Artificial Intelligence

Abstract

Many high-performance machine learning models in the real world exhibit the black box problem. This issue is widely recognized as needing output reliability and model transparency. XAI (Explainable AI) represents a research field that addresses this issue. Within XAI, feature attribution methods, which clarify the importance of features irrespective of the task or model type, have become a central focus. Evaluating their efficacy based on empirical evidence is essential when proposing new methods. However, extensive debate exists regarding the properties that importance should be possessed, and a consensus on specific evaluation methods remains elusive. Given this context, many existing studies adopt their evaluation techniques, leading to fragmented discussions. This study aims to "evaluate the evaluation methods," focusing mainly on the faithfulness metric, deemed especially significant in evaluation criteria. We conducted empirical experiments related to existing evaluation techniques. The experiments approached the topic from two angles: correlation-based comparative evaluations and property verification using random sequences. In the former experiment, we investigated the correlation between faithfulness evaluation tests using numerous models and feature attribution methods. As a result, we found that very few test combinations exhibited high correlation, and many combinations showed low or no correlation. In the latter experiment, we observed that the measured faithfulness varied depending on the model and dataset by using random sequences instead of feature attribution methods to verify the properties of the faithfulness tests.

Keywords:
Computer science Feature (linguistics) Correlation Context (archaeology) Artificial intelligence Machine learning Empirical research Metric (unit) Reliability (semiconductor) Data mining Statistics Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
22
Refs
0.14
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Explainable Artificial Intelligence (XAI)
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.