Score: 0

Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey

Published: April 21, 2025 | arXiv ID: 2504.14891v1

By: Aoran Gan , Hao Yu , Kai Zhang and more

Potential Business Impact:

Tests how AI uses outside facts to answer questions.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Recent advancements in Retrieval-Augmented Generation (RAG) have revolutionized natural language processing by integrating Large Language Models (LLMs) with external information retrieval, enabling accurate, up-to-date, and verifiable text generation across diverse applications. However, evaluating RAG systems presents unique challenges due to their hybrid architecture that combines retrieval and generation components, as well as their dependence on dynamic knowledge sources in the LLM era. In response, this paper provides a comprehensive survey of RAG evaluation methods and frameworks, systematically reviewing traditional and emerging evaluation approaches, for system performance, factual accuracy, safety, and computational efficiency in the LLM era. We also compile and categorize the RAG-specific datasets and evaluation frameworks, conducting a meta-analysis of evaluation practices in high-impact RAG research. To the best of our knowledge, this work represents the most comprehensive survey for RAG evaluation, bridging traditional and LLM-driven methods, and serves as a critical resource for advancing RAG development.

When Retrieval Succeeds and Fails: Rethinking Retrieval-Augmented Generation for LLMs

Computation and Language

Helps smart computers learn new things faster.

10 Oct 2025 2

95%

Knowledge-Graph Based RAG System Evaluation Framework

Computation and Language

Tests AI writing better by checking its thinking.

2 Oct 2025 2

95%

Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers

Information Retrieval

Helps computers answer questions with real-world facts.

28 May 2025 1

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

18 pages

Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey

Tests how AI uses outside facts to answer questions.

Technical Abstract

When Retrieval Succeeds and Fails: Rethinking Retrieval-Augmented Generation for LLMs

Knowledge-Graph Based RAG System Evaluation Framework

Retrieval-Augmented Generation: A Comprehensive Survey of Architectures, Enhancements, and Robustness Frontiers