JOURNAL ARTICLE

Software Defect Prediction Using Random Forest Algorithm

Abstract

The software defect can cause the unnecessary effects on the software such as cost and quality. The prediction of the software defect can be useful for the development of good quality software. For the prediction, the PROMISE public dataset will be used and random forest (RF) algorithm will be applied with the RAPIDMINER machine learning tool. This paper will compare the performance evaluation upon the different number of trees in RF. As the results, the accuracy will be slightly increased if the number of trees will be more. The maximum accuracy is up to 99.59 and the minimum accuracy is 85.96. Another comparison is based on AUC curve that represents the most informative indicator of predictive accuracy within the field of software defect prediction. All of the results show that RF algorithm is effective in this prediction which is more suitable with the usage of hundred trees in the RF.

Keywords:
Random forest Computer science Software Software quality Software bug Machine learning Quality (philosophy) Artificial intelligence Algorithm Software metric Predictive modelling Field (mathematics) Data mining Software development Mathematics

Metrics

35
Cited By
2.23
FWCI (Field Weighted Citation Impact)
4
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Software System Performance and Reliability
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.