JOURNAL ARTICLE

Toward Comprehensible Software Fault Prediction Models Using Bayesian Network Classifiers

Karel DejaegerThomas VerbrakenBart Baesens

Year: 2012 Journal:   IEEE Transactions on Software Engineering Vol: 39 (2)Pages: 237-257   Publisher: IEEE Computer Society

Abstract

Software testing is a crucial activity during software development and fault prediction models assist practitioners herein by providing an upfront identification of faulty software code by drawing upon the machine learning literature. While especially the Naive Bayes classifier is often applied in this regard, citing predictive performance and comprehensibility as its major strengths, a number of alternative Bayesian algorithms that boost the possibility to construct simpler networks with less nodes and arcs remain unexplored. This study contributes to the literature by considering 15 different Bayesian Network (BN) classifiers and comparing them to other popular machine learning techniques. Furthermore, the applicability of the Markov blanket principle for feature selection, which is a natural extension to BN theory, is investigated. The results, both in terms of the AUC and the recently introduced H-measure, are rigorously tested using the statistical framework of Demsar. It is concluded that simple and comprehensible networks with less nodes can be constructed using BN classifiers other than the Naive Bayes classifier. Furthermore, it is found that the aspects of comprehensibility and predictive performance need to be balanced out, and also the development context is an item which should be taken into account during model selection.

Keywords:
Markov blanket Computer science Machine learning Artificial intelligence Naive Bayes classifier Classifier (UML) Bayesian network Software Data mining Feature selection Markov chain Markov model Support vector machine Variable-order Markov model Programming language

Metrics

189
Cited By
25.86
FWCI (Field Weighted Citation Impact)
131
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Software System Performance and Reliability
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.