JOURNAL ARTICLE

Explainable Deep Learning Framework for Binary Corrosion Image Classification Using Grad-CAM

Muhammad Amir Imran AminudinMohd Na’im AbdullahFaizal MustaphaKee Kok EngMazli MustaphaAliyu Mustapha

Year: 2025 Journal:   Sensors Vol: 25 (22)Pages: 7070-7070   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Corrosion in metallic materials is a critical challenge in maintenance and safety, and traditional visual inspection methods are often time-consuming, labor-intensive, and dependent on human expertise, highlighting the need for more efficient and reliable solutions. Deep learning, particularly convolutional neural networks (CNNs), provides a promising approach by enabling automated and accurate image-based classification. This study investigates binary image classification of corrosion using four pre-trained CNN architectures, namely ResNet50, MobileNetV2, NASNetMobile, and EfficientNetV2B0, and integrates explainable artificial intelligence (XAI) techniques to provide interpretability and insight into each model’s decision-making process. A curated dataset of 4012 images, divided between corroded and non-corroded surfaces, was pre-processed, and augmented images resulted in a total of 9636 images used to train and evaluate the models. Performance was assessed through accuracy, confusion matrices, computational timing, receiver operating characteristic curves, precision–recall curves, and Cohen’s Kappa. In this paper, Gradient-weighted Class Activation Mapping (Grad-CAM) visualizations are incorporated as an XAI technique to provide interpretable insight into the model’s reasoning process, enabling clear identification of corrosion regions and offering justification for each prediction produced by the system. A key contribution of this work is the integration of Grad-CAM to enhance explainability. The results showed that EfficientNetV2B0 demonstrates stable training with minimal sign overfitting compared to other models. MobileNetV2 achieved the lowest time to train with the datasets given, and ResNet50 achieved the highest classification performance in terms of confusion matrix, with an accuracy of 96.58%. Through Grad-CAM reasoning, EfficientNetV2B0 shows a specific high activation towards corroded regions compared to the other three models that were evaluated.

Keywords:

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
40
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Related Documents

JOURNAL ARTICLE

Explainable Deep Learning Framework for Pneumonia Detection Using Grad-CAM

Journal:   International Research Journal of Modernization in Engineering Technology and Science Year: 2025
JOURNAL ARTICLE

AN EXPLAINABLE DEEP LEARNING FRAMEWORK FOR BREAST CANCER CLASSIFICATION USING EFFICIENTNETV2B0 AND GRAD-CAM

Diman HassanJane Haj Ali

Journal:   Science Journal of University of Zakho Year: 2026 Vol: 14 (1)
JOURNAL ARTICLE

An Explainable Deep Learning Framework for Agtron-Based Coffee Roast Classification Using Grad-CAM

Havva Hazel ARASYusuf EryesilMurat KOKLU

Journal:   Proceedings of international conference on intelligent systems and new applications. Year: 2025
© 2026 ScienceGate Book Chapters — All rights reserved.