A Top-k Learning to Rank Approach to Cross-Project Software Defect Prediction

Feng Wang; Jinxiao Huang; Yutao Ma

doi:10.1109/apsec.2018.00048

ScienceGate Book Chapters

JOURNAL ARTICLE

A Top-k Learning to Rank Approach to Cross-Project Software Defect Prediction

Feng Wang Jinxiao Huang Yutao Ma

Year: 2018 Pages: 335-344

DOI: 10.1109/apsec.2018.00048

Get Full-Text PDF Get Analytical Report

Abstract

Cross-project defect prediction (CPDP) has recently attracted increasing attention in the field of Software Engineering. Most of the previous studies, which treated it as a binary classification problem or a regression problem, are not practical for software testing activities. To provide developers with a more valuable ranking of the most severe entities (e.g., classes and modules), in this paper, we propose a top-k learning to rank (LTR) approach in the scenario of CPDP. In particular, we first convert the number of defects into graded relevance to a specific query according to the three-sigma rule; then, we put forward a new data resampling method called SMOTE-PENN to tackle the imbalanced data problem. An empirical study on the PROMISE dataset shows that SMOTE-PENN outperforms the other six competitive resampling algorithms and RankNet performs the best for the proposed approach framework. Thus, our work could lay a foundation for efficient search engines for top-ranked defective entities in real software testing activities without local historical data for a target project.

Keywords:

Computer science Relevance (law) Ranking (information retrieval) Learning to rank Resampling Machine learning Data mining Software bug Field (mathematics) Software Rank (graph theory) Artificial intelligence Binary classification Binary number Support vector machine Programming language

Metrics

Cited By

4.91

FWCI (Field Weighted Citation Impact)

Refs

0.95

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Software Engineering Research

Physical Sciences → Computer Science → Information Systems

Software Reliability and Analysis Research

Physical Sciences → Computer Science → Software

Software Testing and Debugging Techniques

Physical Sciences → Computer Science → Software

A Top-k Learning to Rank Approach to Cross-Project Software Defect Prediction

Abstract

Metrics

Citation History

Topics

Related Documents

Software defect prediction using learning to rank approach

Using active learning selection approach for cross-project software defect prediction

A Learning-to-Rank Approach to Software Defect Prediction

Cross-project software defect prediction through multiple learning

Manifold Learning for Cross-project Software Defect Prediction