Misclassification cost-sensitive fault prediction models

Yue Jiang; Bojan Čukić

doi:10.1145/1540438.1540466

ScienceGate Book Chapters

JOURNAL ARTICLE

Misclassification cost-sensitive fault prediction models

Yue Jiang Bojan Čukić

Year: 2009 Pages: 1-10

DOI: 10.1145/1540438.1540466

Get Full-Text PDF Get Analytical Report

Abstract

Traditionally, software fault prediction models are built by assuming a uniform misclassification cost. In other words, cost implications of misclassifying a faulty module as fault free are assumed to be the same as the cost implications of misclassifying a fault free module as faulty. In reality, these two types of misclassification costs are rarely equal. They are project-specific, reflecting the characteristics of the domain in which the program operates. In this paper, using project information from a public repository, we analyze the benefits of techniques which incorporate misclassification costs in the development of software fault prediction models. We find that cost-sensitive learning does not provide operational points which outperform cost-insensitive classifiers. However, an advantage of cost-sensitive modeling is the explicit choice of the operational threshold appropriate for the cost differential.

Keywords:

Computer science Fault (geology) Software Domain (mathematical analysis) Reliability engineering Data mining Cost estimate Differential (mechanical device) Machine learning Engineering Mathematics Systems engineering

Metrics

Cited By

6.02

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Software Engineering Research

Physical Sciences → Computer Science → Information Systems

Software Reliability and Analysis Research

Physical Sciences → Computer Science → Software

Software System Performance and Reliability

Physical Sciences → Computer Science → Computer Networks and Communications

Misclassification cost-sensitive fault prediction models

Abstract

Metrics

Citation History

Topics

Related Documents

Misclassification Cost-Sensitive Software Defect Prediction

Instance-dependent misclassification cost-sensitive learning for default prediction

Heterogeneous fault prediction with cost‐sensitive domain adaptation

Cost Curve Evaluation of Fault Prediction Models

Cost-sensitive business failure prediction when misclassification costs are uncertain: A heterogeneous ensemble selection approach