Muhammad Nabil Muyassar RahmanRadityo Adi NugrohoMohammad Reza FaisalFriska AbadiRudy Herteno
<p>In software defect prediction, noisy attributes and high-dimensional data <br>remain to be a critical challenge. This paper introduces a novel approach <br>known as multi correlation-based feature selection (MCFS), which seeks to <br>address these challenges. MCFS integrates two feature selection techniques, <br>namely correlation-based feature selection (CFS) and correlation matrixbased feature selection (CMFS), intending to reduce data dimensionality and <br>eliminate noisy attributes. To accomplish this, CFS and CMFS are applied <br>independently to filter the datasets, and a weighted average of their <br>outcomes is computed to determine the optimal feature selection. This <br>approach not only reduces data dimensionality but also mitigates the impact <br>of noisy attributes. To further enhance predictive performance, this paper <br>leverages the particle swarm optimization (PSO) algorithm as a feature <br>selection mechanism, specifically targeting improvements in the area under <br>the curve (AUC). The evaluation of the proposed method is conducted on 12 <br>benchmark datasets sourced from the NASA metrics data program (MDP)<br>corpus, renowned for their noisy attributes, high dimensionality, and <br>imbalanced class records. The research findings demonstrate that MCFS <br>outperforms CFS and CMFS, yielding an average AUC value of 0.891, <br>thereby emphasizing it is efficacy in advancing classification performance in <br>the context of software defect prediction using k-nearest neighbors (KNN) <br>classification.</p>
Radityo Adi NugrohoFriska AbadiMuhammad FaisalRudy HertenoRahmat Ramadhani
Abdullateef Oluwagbemiga BalogunShuib BasriSaid Jadid AbdulkadirSaipunidzam MahamadMalek A. Al-momamniAbdullahi Abubakar ImamGanesh Kumar
J. NandithaK SruthiSreeja AshokM. V. Judy
Mohamed M. IbrahimYasmin Makki MohialdenDoaa Mohsin Abd Ali Afraji