JOURNAL ARTICLE

Graph-Based Feature Learning for Cross-Project Software Defect Prediction

Ahmed AbduZhengjun ZhaiHakim A. AbdoRedhwan AlgabriSungon Lee

Year: 2023 Journal:   Computers, materials & continua/Computers, materials & continua (Print) Vol: 77 (1)Pages: 161-180

Abstract

Cross-project software defect prediction (CPDP) aims to enhance defect prediction in target projects with limited or no historical data by leveraging information from related source projects.The existing CPDP approaches rely on static metrics or dynamic syntactic features, which have shown limited effectiveness in CPDP due to their inability to capture higher-level system properties, such as complex design patterns, relationships between multiple functions, and dependencies in different software projects, that are important for CPDP.This paper introduces a novel approach, a graph-based feature learning model for CPDP (GB-CPDP), that utilizes NetworkX to extract features and learn representations of program entities from control flow graphs (CFGs) and data dependency graphs (DDGs).These graphs capture the structural and data dependencies within the source code.The proposed approach employs Node2Vec to transform CFGs and DDGs into numerical vectors and leverages Long Short-Term Memory (LSTM) networks to learn predictive models.The process involves graph construction, feature learning through graph embedding and LSTM, and defect prediction.Experimental evaluation using nine open-source Java projects from the PROMISE dataset demonstrates that GB-CPDP outperforms state-of-the-art CPDP methods in terms of F1-measure and Area Under the Curve (AUC).The results showcase the effectiveness of GB-CPDP in improving the performance of cross-project defect prediction.

Keywords:
Computer science Graph Feature (linguistics) Software Artificial intelligence Machine learning Theoretical computer science Programming language

Metrics

5
Cited By
3.09
FWCI (Field Weighted Citation Impact)
39
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Engineering Techniques and Practices
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.