On the transferability of local model‐agnostic explanations of machine learning models to unseen data

Alba María López González; Esteban García-Cuesta

doi:10.1109/eais58494.2024.10570001

ScienceGate Book Chapters

JOURNAL ARTICLE

On the transferability of local model‐agnostic explanations of machine learning models to unseen data

Alba María López González Esteban García-Cuesta

Year: 2024 Pages: 1-10

DOI: 10.1109/eais58494.2024.10570001

Get Full-Text PDF Get Analytical Report

Abstract

Numerous methods have been developed to address the critical need to understand the behavior of AI systems. Arguably, the most popular are model-agnostic local explanation techniques, which focus on examining model behavior for individual instances. While several implementations have been proposed, comparatively less attention has been paid to assessing the robustness and transferability of the generated explanations to unseen data. More importantly, most robustness analyzes have focused on differentiable models and deep neural networks. In this paper, we analyze the robustness of two well-known model-agnostic explanation methods, LIME and SHAP, from a methodological perspective and propose a criterion to measure the transferability of explanations from the training to the testing phases. Therefore, the proposed methodology validates explanations not only in terms of model performance but also in terms of their robustness during the learning process. We conclude that the transferability of SHAP explanations is better in sparse or low-density data sets than that of LIME, while the opposite is true for very dense data sets. We also observed that there are no significant differences between the results obtained for different machine learning models combined with these two model-agnostic techniques.

Keywords:

Transferability Computer science Data modeling Artificial intelligence Machine learning Data science Software engineering

Metrics

Cited By

2.56

FWCI (Field Weighted Citation Impact)

Refs

0.86

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

On the transferability of local model‐agnostic explanations of machine learning models to unseen data

Abstract

Metrics

Citation History

Topics

Related Documents

Local Model-Agnostic Explanations for Machine Learning and Time-series Forecasting Models

Evaluating Local Interpretable Model-Agnostic Explanations on Clinical Machine Learning Classification Models

Local Model-Agnostic Explanations for Machine Learning and Time-series Forecasting Models

Enhancing trust and interpretability of complex machine learning models using local interpretable model agnostic shap explanations

Local Interpretable Model-Agnostic Explanations for Neural Ranking Models