JOURNAL ARTICLE

Interpretable Software Defect Prediction from Project Effort and Static Code Metrics

Susmita HaldarLuiz Fernando Capretz

Year: 2024 Journal:   Computers Vol: 13 (2)Pages: 52-52   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Software defect prediction models enable test managers to predict defect-prone modules and assist with delivering quality products. A test manager would be willing to identify the attributes that can influence defect prediction and should be able to trust the model outcomes. The objective of this research is to create software defect prediction models with a focus on interpretability. Additionally, it aims to investigate the impact of size, complexity, and other source code metrics on the prediction of software defects. This research also assesses the reliability of cross-project defect prediction. Well-known machine learning techniques, such as support vector machines, k-nearest neighbors, random forest classifiers, and artificial neural networks, were applied to publicly available PROMISE datasets. The interpretability of this approach was demonstrated by SHapley Additive exPlanations (SHAP) and local interpretable model-agnostic explanations (LIME) techniques. The developed interpretable software defect prediction models showed reliability on independent and cross-project data. Finally, the results demonstrate that static code metrics can contribute to the defect prediction models, and the inclusion of explainability assists in establishing trust in the developed models.

Keywords:
Interpretability Computer science Predictive modelling Machine learning Software quality Reliability (semiconductor) Software Random forest Software bug Artificial intelligence Data mining Software metric Support vector machine Code (set theory) Source code Quality (philosophy) Reliability engineering Software development Set (abstract data type) Engineering Programming language

Metrics

12
Cited By
18.33
FWCI (Field Weighted Citation Impact)
61
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Software System Performance and Reliability
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.