A concept-aware explainability method for convolutional neural networks

Mustafa Kağan Gürkan; Nafiz Arıca; Fatoş T. Yarman Vural

doi:10.1007/s00138-024-01653-w

ScienceGate Book Chapters

JOURNAL ARTICLE

A concept-aware explainability method for convolutional neural networks

Mustafa Kağan Gürkan Nafiz Arıca Fatoş T. Yarman Vural

Year: 2025 Journal: Machine Vision and Applications Vol: 36 (2) Publisher: Springer Science+Business Media

DOI: 10.1007/s00138-024-01653-w

Get Full-Text PDF Get Analytical Report

Abstract

Abstract Although Convolutional Neural Networks (CNN) outperform the classical models in a wide range of Machine Vision applications, their restricted interpretability and their lack of comprehensibility in reasoning, generate many problems such as security, reliability, and safety. Consequently, there is a growing need for research to improve explainability and address their limitations. In this paper, we propose a concept-based method, called Concept-Aware Explainability (CAE) to provide a verbal explanation for the predictions of pre-trained CNN models. A new measure, called detection score mean, is introduced to quantify the relationship between the filters of the model and a set of pre-defined concepts. Based on the detection score mean values, we define sorted lists of Concept-Aware Filters (CAF) and Filter-Activating Concepts (FAC). These lists are used to generate explainability reports, where we can explain, analyze, and compare models in terms of the concepts embedded in the image. The proposed explainability method is compared to the state-of-the-art methods to explain Resnet18 and VGG16 models, pre-trained on ImageNet and Places365-Standard datasets. Two popular metrics, namely, the number of unique detectors and the number of detecting filters, are used to make a quantitative comparison. Superior performances are observed for the suggested CAE, when compared to Network Dissection (NetDis) (Bau et al., in: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), 2017), Net2Vec (Fong and Vedaldi, in: Paper presented at IEEE conference on computer vision and pattern recognition (CVPR), 2018), and CLIP-Dissect (CLIP-Dis) (Oikarinen and Weng, in: The 11th international conference on learning representations (ICLR), 2023) methods.

Keywords:

Interpretability Computer science Convolutional neural network Artificial intelligence Set (abstract data type) Reliability (semiconductor) Machine learning Range (aeronautics) Filter (signal processing) Pattern recognition (psychology) Computer vision

Metrics

Cited By

4.82

FWCI (Field Weighted Citation Impact)

Refs

0.91

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Explainable Artificial Intelligence (XAI)

Physical Sciences → Computer Science → Artificial Intelligence

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Neural Network Applications

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

A concept-aware explainability method for convolutional neural networks

Abstract

Metrics

Citation History

Topics

Related Documents

Towards Explainability of non-Convolutional Neural Networks

Explainability Methods for Graph Convolutional Neural Networks

Content-aware convolutional neural networks

Explainability of Convolutional Neural Networks for Dermatological Diagnosis

Layer factor analysis in convolutional neural networks for explainability