Evaluating Precision and Recall at Retrieval Time in Retrieval-Augmented Generation (RAG) Systems

Gopichand Agnihotram; Joydeep Sarkar

doi:10.11648/j.ajcst.20250804.11

ScienceGate Book Chapters

JOURNAL ARTICLE

Evaluating Precision and Recall at Retrieval Time in Retrieval-Augmented Generation (RAG) Systems

Gopichand Agnihotram Joydeep Sarkar

Year: 2025 Journal: American Journal of Computer Science and Technology Vol: 8 (4)Pages: 174-180 Publisher: Science Publishing Group

DOI: 10.11648/j.ajcst.20250804.11

Get Full-Text PDF Get Analytical Report

Abstract

Retrieval-Augmented Generation (RAG) systems signify a pivotal advancement in natural language processing, merging information retrieval with large language models (LLMs) to ground responses in external knowledge. This hybrid approach enhances the factual accuracy and currency of generated content, mitigating common issues like hallucination. The efficacy of a RAG system, however, is fundamentally dependent on the performance of its retrieval component. This paper provides a detailed analysis of precision and recall as critical metrics for evaluating and optimizing this retrieval step. We explore the distinct roles and inherent trade-offs of these metrics within a RAG pipeline, demonstrating their direct influence on the quality of the final output. Through a series of experiments comparing sparse (BM25), dense (DPR), and hybrid retrieval methods, we quantify their performance characteristics. The analysis is further enriched with real-world examples from finance, law, and healthcare, illustrating the practical implications of retrieval quality. Additionally, we outline advanced strategies for improving retrieval effectiveness, such as multi-stage architecture involving rerankers and the use of query transformations. The paper concludes with a set of best practices for deploying robust, enterprise-grade RAG systems, emphasizing the need for continuous evaluation and sophisticated retrieval strategies. By focusing on the systematic optimization of precision and recall, organizations can build more reliable and trustworthy AI applications.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Evaluating Precision and Recall at Retrieval Time in Retrieval-Augmented Generation (RAG) Systems

Abstract

Metrics

Topics

Related Documents

Building Retrieval-Augmented Generation (RAG) Systems

Retrieval-Augmented Generation (RAG)

Retrieval Augmented Generation (RAG) Model

Retrieval Augmented Generation (RAG) Model

Retrieval-Augmented Generation (RAG) Chatbots