Feature Selection in Machine Learning for Cross-Project Software Defect Number Prediction

Lindung Parningotan Manik

doi:10.1109/icts58770.2023.10330829

ScienceGate Book Chapters

JOURNAL ARTICLE

Feature Selection in Machine Learning for Cross-Project Software Defect Number Prediction

Lindung Parningotan Manik

Year: 2023 Vol: 13 Pages: 18-23

DOI: 10.1109/icts58770.2023.10330829

Get Full-Text PDF Get Analytical Report

Abstract

Although many studies have investigated the usage of machine learning (ML) algorithms for cross-project software defect prediction, limited works addressed cross-project defect number prediction (CPDNP). This study investigated the robustness of feature selection in five ML algorithms, such as Decision Trees, Random Forests, Gradient Boosted Trees, Generalized Linear Model, and Deep Neural Networks, for CPDNP. The results showed that Gradient Boosted Tree generated models with the lowest errors in most projects. However, models generated by Random Forests and Deep Neural Networks with feature selection were considered the most robust ones. Meanwhile, models generated by Decision Trees were the least robust. The feature selection was sufficient to generate robust ML models for CPDNP.

Keywords:

Random forest Feature selection Artificial intelligence Computer science Machine learning Robustness (evolution) Decision tree Artificial neural network Feature (linguistics) Feature engineering Software Selection (genetic algorithm) Deep neural networks Deep learning Data mining

Metrics

Cited By

1.24

FWCI (Field Weighted Citation Impact)

Refs

0.82

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Software Engineering Research

Physical Sciences → Computer Science → Information Systems

Software Reliability and Analysis Research

Physical Sciences → Computer Science → Software

Software Engineering Techniques and Practices

Physical Sciences → Computer Science → Information Systems

Feature Selection in Machine Learning for Cross-Project Software Defect Number Prediction

Abstract

Metrics

Citation History

Topics

Related Documents

Feature Selection in Cross-Project Software Defect Prediction

Cross-Project Software Defect Prediction Based on Feature Selection and Transfer Learning

Cross-Project Software Defect Prediction Using Feature-Based Transfer Learning

Graph-Based Feature Learning for Cross-Project Software Defect Prediction

Cross-Project Software Defect Prediction Based on Feature Selection and Knowledge Distillation