JOURNAL ARTICLE

Dictionary learning based software defect prediction

Abstract

In order to improve the quality of a software system, software defect prediction aims to automatically identify defective software modules for efficient software test. To predict software defect, those classification methods with static code attributes have attracted a great deal of attention. In recent years, machine learning techniques have been applied to defect prediction. Due to the fact that there exists the similarity among different software modules, one software module can be approximately represented by a small proportion of other modules. And the representation coefficients over the pre-defined dictionary, which consists of historical software module data, are generally sparse. In this paper, we propose to use the dictionary learning technique to predict software defect. By using the characteristics of the metrics mined from the open source software, we learn multiple dictionaries (including defective module and defective-free module sub-dictionaries and the total dictionary) and sparse representation coefficients. Moreover, we take the misclassification cost issue into account because the misclassification of defective modules generally incurs much higher risk cost than that of defective-free ones. We thus propose a cost-sensitive discriminative dictionary learning (CDDL) approach for software defect classification and prediction. The widely used datasets from NASA projects are employed as test data to evaluate the performance of all compared methods. Experimental results show that CDDL outperforms several representative state-of-the-art defect prediction methods.

Keywords:
Computer science Discriminative model Software Software bug Artificial intelligence Software metric Machine learning Software quality Data mining Software development Pattern recognition (psychology) Programming language

Metrics

220
Cited By
36.31
FWCI (Field Weighted Citation Impact)
63
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Software System Performance and Reliability
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Software Defect Prediction Using Dictionary Learning

Hongyan WanGuoqing WuMing ChengQing HuangRui WangMengting Yuan

Journal:   Proceedings/Proceedings of the ... International Conference on Software Engineering and Knowledge Engineering Year: 2017 Vol: 2017 Pages: 335-340
JOURNAL ARTICLE

Software Defect Prediction Based on Cost-Sensitive Dictionary Learning

Hongyan WanGuoqing WuMali YuMengting Yuan

Journal:   International Journal of Software Engineering and Knowledge Engineering Year: 2019 Vol: 29 (09)Pages: 1219-1243
JOURNAL ARTICLE

Cost-sensitive Dictionary Learning for Software Defect Prediction

Liang NiuJianwu WanHongyuan WangKaiwei Zhou

Journal:   Neural Processing Letters Year: 2020 Vol: 52 (3)Pages: 2415-2449
JOURNAL ARTICLE

Learning based Software Defect Prediction

Divya Kumari Tankala

Journal:   International Journal of Computational Intelligence Research Year: 2023 Vol: 19 (1)Pages: 51-62
JOURNAL ARTICLE

Semi‐supervised Software Defect Prediction Using Task‐Driven Dictionary Learning

Ming ChengGuoqing WuMengting YuanHongyan Wan

Journal:   Chinese Journal of Electronics Year: 2016 Vol: 25 (6)Pages: 1089-1096
© 2026 ScienceGate Book Chapters — All rights reserved.