An Information Flow-based Feature Selection Method for Cross-Project Defect Prediction

Yaning Wu

doi:10.23940/ijpe.18.06.p17.12631274

ScienceGate Book Chapters

JOURNAL ARTICLE

An Information Flow-based Feature Selection Method for Cross-Project Defect Prediction

Yaning Wu

Year: 2018 Journal: International Journal of Performability Engineering Publisher: Totem Publisher

DOI: 10.23940/ijpe.18.06.p17.12631274

Get Full-Text PDF Get Analytical Report

Abstract

Software defect prediction (SDP) plays a significant part in identifying the most defect-prone modules before software testing and allocating limited testing resources.One of the most commonly used scenarios in SDP is classification.To guarantee the prediction accuracy, the classification models should first be trained appropriately.The training data could be obtained from historical software repositories, which may affect the performance of classification to a large extent.In order to improve the data quality, we propose a novel software feature selection method, which innovatively utilizes the information flows to perform causality analysis in the features of training datasets.More specifically, we conduct causality analysis between each feature metric and the labeled metric bug; then, based on the obtained feature ranking list, we select the top-k features to control redundancy.Finally, we choose the most suitable feature subset based on the F-measure.To demonstrate the effectiveness and practicability of the feature selection method, we select the Nearest Neighbor approach to construct a homogeneous training dataset, and utilize three commonly used classification models to implement comparison experiments.The final experimental results have verified the availability and validity of the feature selection method.

Keywords:

Feature selection Computer science Data mining Selection (genetic algorithm) Information flow Flow (mathematics) Feature (linguistics) Artificial intelligence Machine learning Reliability engineering Pattern recognition (psychology) Engineering Mathematics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.10

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Manufacturing Process and Optimization

Physical Sciences → Engineering → Industrial and Manufacturing Engineering

Industrial Vision Systems and Defect Detection

Physical Sciences → Engineering → Industrial and Manufacturing Engineering

BIM and Construction Integration

Physical Sciences → Engineering → Building and Construction

An Information Flow-based Feature Selection Method for Cross-Project Defect Prediction

Abstract

Metrics

Citation History

Topics

Related Documents

Cross‐project defect prediction method based on genetic algorithm feature selection

A Cluster Based Feature Selection Method for Cross-Project Software Defect Prediction

Feature Selection in Cross-Project Software Defect Prediction

Candidate project selection method combined with feature filtering in cross project defect prediction

Cross-Project Defect Prediction Method based on Feature Distribution Alignment and Neighborhood Instance Selection