JOURNAL ARTICLE

Ensemble Machine Learning Paradigms in Software Defect Prediction

Tarunim SharmaAman JatainShalini BhaskarKavita Pabreja

Year: 2023 Journal:   Procedia Computer Science Vol: 218 Pages: 199-209   Publisher: Elsevier BV

Abstract

Predicting faults in software aims to detect defects before the testing phase, allowing for better resource allocation and high-quality software development, which is a requisite for any organization. Machine learning techniques aid in the resolution of such issues and a variety of predictive models are being developed to categorize the software into defective modules and the one which is non-defective ones. Though applying these advanced machine learning techniques results in better utilization of time and other resources, there is still poor prediction as reported in many studies. This is because of several challenges that block defective software data, including redundancy, correlation, feature irrelevance, missing samples, and an imbalanced distribution between the faulty and non-faulty classes. Ensemble Machine learning has been adopted by practitioners and researchers globally to deal with such problems, and it is proven to demonstrate some improvement in defect prediction performance. In this review paper, all ensemble-based machine learning techniques developed for software defect prediction from 2018 to 2021 have been critically analyzed. The nucleus of this paper is to get a deep insight into why the various hybrid models still suffer from poor performance on the available datasets. A detailed review with a focus on multiple perspectives viz. faulty and non-defective datasets, performance evaluation criteria, and machine learning techniques have revealed certain gaps that can be addressed by developing more robust hyperparameter optimization algorithms, feature engineering, developing stacking and averaging models.

Keywords:
Computer science Machine learning Artificial intelligence Hyperparameter Ensemble learning Software Feature (linguistics) Software quality Feature engineering Categorization Data mining Deep learning Software development

Metrics

29
Cited By
17.94
FWCI (Field Weighted Citation Impact)
23
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Imbalanced Data Classification Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Ensemble Machine Learning Model for Software Defect Prediction

Journal:   Advances in Machine Learning & Artificial Intelligence Year: 2021 Vol: 2 (1)
BOOK-CHAPTER

Software Defect Prediction: A Machine Learning Approach with Voting Ensemble

Marcela MosqueraRemigio Hurtado

Lecture notes in networks and systems Year: 2024 Pages: 585-595
JOURNAL ARTICLE

Ensemble learning based software defect prediction

Xin DongYan LiangShoichiro MiyamotoShingo Yamaguchi

Journal:   Journal of Engineering Research Year: 2023 Vol: 11 (4)Pages: 377-391
© 2026 ScienceGate Book Chapters — All rights reserved.