BOOK-CHAPTER

Can We Really Trust Explanations? Evaluating the Stability of Feature Attribution Explanation Methods via Adversarial Attack

Yang ZhaoYuanzhe ZhangZhongtao JiangYiming JuJun ZhaoKang Liu

Year: 2022 Lecture notes in computer science Pages: 281-297   Publisher: Springer Science+Business Media
Keywords:
Adversarial system Computer science Credibility Stability (learning theory) Frame (networking) Feature (linguistics) Transparency (behavior) Trustworthiness Artificial intelligence Semantics (computer science) Attribution Machine learning Computer security Epistemology Linguistics

Metrics

3
Cited By
1.09
FWCI (Field Weighted Citation Impact)
29
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Adversarial Robustness in Machine Learning
Physical Sciences →  Computer Science →  Artificial Intelligence
Explainable Artificial Intelligence (XAI)
Physical Sciences →  Computer Science →  Artificial Intelligence
Scientific Computing and Data Management
Social Sciences →  Decision Sciences →  Information Systems and Management
© 2026 ScienceGate Book Chapters — All rights reserved.