Towards Global Explanations of Convolutional Neural Networks With Concept Attribution

Weibin Wu; Yuxin Su; Xixian Chen; Shenglin Zhao; Irwin King; Michael R. Lyu; Yu‐Wing Tai

doi:10.1109/cvpr42600.2020.00868

ScienceGate Book Chapters

JOURNAL ARTICLE

Towards Global Explanations of Convolutional Neural Networks With Concept Attribution

Weibin Wu Yuxin Su Xixian Chen Shenglin Zhao Irwin King Michael R. Lyu Yu‐Wing Tai

Year: 2020 Pages: 8649-8658

DOI: 10.1109/cvpr42600.2020.00868

Get Full-Text PDF Get Analytical Report

Abstract

With the growing prevalence of convolutional neural networks (CNNs), there is an urgent demand to explain their behaviors. Global explanations contribute to understanding model predictions on a whole category of samples, and thus have attracted increasing interest recently. However, existing methods overwhelmingly conduct separate input attribution or rely on local approximations of models, making them fail to offer faithful global explanations of CNNs. To overcome such drawbacks, we propose a novel two-stage framework, Attacking for Interpretability (AfI), which explains model decisions in terms of the importance of user-defined concepts. AfI first conducts a feature occlusion analysis, which resembles a process of attacking models to derive the category-wide importance of different features. We then map the feature importance to concept importance through ad-hoc semantic tasks. Experimental results confirm the effectiveness of AfI and its superiority in providing more accurate estimations of concept importance than existing proposals.

Keywords:

Interpretability Convolutional neural network Computer science Attribution Feature (linguistics) Artificial intelligence Process (computing) Machine learning Psychology

Metrics

Cited By

4.11

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Machine Learning in Healthcare

Physical Sciences → Computer Science → Artificial Intelligence

Towards Global Explanations of Convolutional Neural Networks With Concept Attribution

Abstract

Metrics

Citation History

Topics

Related Documents

Neural Networks with Feature Attribution and Contrastive Explanations

Feature Attribution Explanations for Spiking Neural Networks

Concept Extraction with Convolutional Neural Networks

Concept Extraction with Convolutional Neural Networks

Global Explanations of Neural Networks