JOURNAL ARTICLE

Low‐rank representation for semi‐supervised software defect prediction

Zhiwu ZhangXiao‐Yuan JingFei Wu

Year: 2018 Journal:   IET Software Vol: 12 (6)Pages: 527-535   Publisher: Institution of Engineering and Technology

Abstract

Software defect prediction based on machine learning is an active research topic in the field of software engineering. The historical defect data in software repositories may contain noises because automatic defect collection is based on modified logs and defect reports. When the previous defect labels of modules are limited, predicting the defect‐prone modules becomes a challenging problem. In this study, the authors propose a graph‐based semi‐supervised defect prediction approach to solve the problems of insufficient labelled data and noisy data. Graph‐based semi‐supervised learning methods used the labelled and unlabelled data simultaneously and consider them as the nodes of the graph at the training phase. Therefore, they solve the problem of insufficient labelled samples. To improve the stability of noisy defect data, a powerful clustering method, low‐rank representation (LRR), and neighbourhood distance are used to construct the relationship graph of samples. Therefore, they propose a new semi‐supervised defect prediction approach, named low‐rank representation‐based semi‐supervised software defect prediction (LRRSSDP). The widely used datasets from NASA projects and noisy datasets are employed as test data to evaluate the performance. Experimental results show that (i) LRRSSDP outperforms several representative state‐of‐the‐art semi‐supervised defect prediction methods; and (ii) LRRSSDP can maintain robustness in noisy environments.

Keywords:
Computer science Robustness (evolution) Software Artificial intelligence Graph Machine learning Data mining Software bug External Data Representation Test data Noisy data Cluster analysis Pattern recognition (psychology) Theoretical computer science Software engineering

Metrics

12
Cited By
0.89
FWCI (Field Weighted Citation Impact)
44
Refs
0.80
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Software Reliability and Analysis Research
Physical Sciences →  Computer Science →  Software
Software System Performance and Reliability
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Semi-supervised low-rank representation for image classification

Chenxue YangMao YeSong TangTao XiangZijian Liu

Journal:   Signal Image and Video Processing Year: 2016 Vol: 11 (1)Pages: 73-80
JOURNAL ARTICLE

Semi-Supervised Classification Based on Low Rank Representation

Xuan HouGuangjun YaoJun Wang

Journal:   Algorithms Year: 2016 Vol: 9 (3)Pages: 48-48
JOURNAL ARTICLE

An Integrated Semi-supervised Software Defect Prediction Model

Fanqi Meng Fanqi MengWenying Cheng Fanqi MengJingdong Wang Wenying Cheng

Journal:   網際網路技術學刊 Year: 2023 Vol: 24 (6)Pages: 1307-1317
© 2026 ScienceGate Book Chapters — All rights reserved.