JOURNAL ARTICLE

Understanding the automated parameter optimization on transfer learning for cross-project defect prediction

Abstract

Data-driven defect prediction has become increasingly important in software engineering process. Since it is not uncommon that data from a software project is insufficient for training a reliable defect prediction model, transfer learning that borrows data/konwledge from other projects to facilitate the model building at the current project, namely cross-project defect prediction (CPDP), is naturally plausible. Most CPDP techniques involve two major steps, i.e., transfer learning and classification, each of which has at least one parameter to be tuned to achieve their optimal performance. This practice fits well with the purpose of automated parameter optimization. However, there is a lack of thorough understanding about what are the impacts of automated parameter optimization on various CPDP techniques. In this paper, we present the first empirical study that looks into such impacts on 62 CPDP techniques, 13 of which are chosen from the existing CPDP literature while the other 49 ones have not been explored before. We build defect prediction models over 20 real-world software projects that are of different scales and characteristics. Our findings demonstrate that: (1) Automated parameter optimization substantially improves the defect prediction performance of 77% CPDP techniques with a manageable computational cost. Thus more efforts on this aspect are required in future CPDP studies. (2) Transfer learning is of ultimate importance in CPDP. Given a tight computational budget, it is more cost-effective to focus on optimizing the parameter configuration of transfer learning algorithms (3) The research on CPDP is far from mature where it is ‘not difficult’ to find a better alternative by making a combination of existing transfer learning and classification techniques. This finding provides important insights about the future design of CPDP techniques.

Keywords:
Computer science Machine learning Transfer of learning Artificial intelligence Software Process (computing) Empirical research Data mining Focus (optics) Mathematics

Metrics

52
Cited By
8.78
FWCI (Field Weighted Citation Impact)
74
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Software Engineering Techniques and Practices
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Source selection and transfer defect learning based cross-project defect prediction

Wanzhi WenNingbo ZhuBingqing YeXikai LiChuyue WangJiawei ChuYuehua Li

Journal:   International Journal of Computing Science and Mathematics Year: 2022 Vol: 16 (3)Pages: 195-195
JOURNAL ARTICLE

Source selection and transfer defect learning based cross-project defect prediction

Chuyue WangJiawei ChuYuehua LiXikai LiWanzhi WenNingbo ZhuBingqing Ye

Journal:   International Journal of Computing Science and Mathematics Year: 2022 Vol: 16 (3)Pages: 195-195
JOURNAL ARTICLE

Impact of Hyper Parameter Optimization for Cross-Project Software Defect Prediction

Yubin Qu

Journal:   International Journal of Performability Engineering Year: 2018
JOURNAL ARTICLE

A Survey on Transfer Learning for Cross-Project Defect Prediction

Bruno Sotto-MayorMeir Kalech

Journal:   IEEE Access Year: 2024 Vol: 12 Pages: 93398-93425
© 2026 ScienceGate Book Chapters — All rights reserved.