JOURNAL ARTICLE

Can Information Retrieval Techniques Effectively Support Traceability Link Recovery?

Abstract

Applying information retrieval (IR) techniques to retrieve all correct links between software artefacts is in general impractical, as usually this means producing a high effort for discarding too many false positives. We show that the only way to recover traceability links using IR methods is to identify an "optimal" threshold that achieves an acceptable balance between traced links and false positives. Unfortunately, such threshold is not known a priori. For this reason we have devised the need to use an incremental traceability recovery approach to gradually identify the threshold where it is more convenient to stop the traceability recovery process, and provide evidence of this in a case study. We also report the experience of using the incremental traceability recovery during the development of software projects

Keywords:
Traceability Computer science False positive paradox A priori and a posteriori Data mining Process (computing) Software Information retrieval Artificial intelligence Software engineering Programming language

Metrics

40
Cited By
9.84
FWCI (Field Weighted Citation Impact)
19
Refs
0.98
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Software Engineering Research
Physical Sciences →  Computer Science →  Information Systems
Open Source Software Innovations
Physical Sciences →  Computer Science →  Computer Science Applications
Software Engineering Techniques and Practices
Physical Sciences →  Computer Science →  Information Systems
© 2026 ScienceGate Book Chapters — All rights reserved.