JOURNAL ARTICLE

A Top-k Learning to Rank Approach to Cross-Project Software Defect Prediction

Abstract

Cross-project defect prediction (CPDP) has recently attracted increasing attention in the field of Software Engineering. Most of the previous studies, which treated it as a binary classification problem or a regression problem, are not practical for software testing activities. To provide developers with a more valuable ranking of the most severe entities (e.g., classes and modules), in this paper, we propose a top-k learning to rank (LTR) approach in the scenario of CPDP. In particular, we first convert the number of defects into graded relevance to a specific query according to the three-sigma rule; then, we put forward a new data resampling method called SMOTE-PENN to tackle the imbalanced data problem. An empirical study on the PROMISE dataset shows that SMOTE-PENN outperforms the other six competitive resampling algorithms and RankNet performs the best for the proposed approach framework. Thus, our work could lay a foundation for efficient search engines for top-ranked defective entities in real software testing activities without local historical data for a target project.

Keywords:
Computer science Relevance (law) Ranking (information retrieval) Learning to rank Resampling Machine learning Data mining Software bug Field (mathematics) Software Rank (graph theory) Artificial intelligence Binary classification Binary number Support vector machine Programming language

Metrics

21
Cited By
4.91
FWCI (Field Weighted Citation Impact)
55
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Software Testing and Debugging Techniques
Physical Sciences →  Computer Science →  Software

Related Documents

JOURNAL ARTICLE

Using active learning selection approach for cross-project software defect prediction

Wenbo MiYong LiMing WenYouren Chen

Journal:   Connection Science Year: 2022 Vol: 34 (1)Pages: 1482-1499
JOURNAL ARTICLE

A Learning-to-Rank Approach to Software Defect Prediction

Xiaoxing YangKe TangXin Yao

Journal:   IEEE Transactions on Reliability Year: 2014 Vol: 64 (1)Pages: 234-246
JOURNAL ARTICLE

Cross-project software defect prediction through multiple learning

Yahaya Zakariyau BalaPathiah Abdul SamatKhaironi Yatim SharifNoridayu Manshor

Journal:   Bulletin of Electrical Engineering and Informatics Year: 2024 Vol: 13 (3)Pages: 2027-2035
© 2026 ScienceGate Book Chapters — All rights reserved.