Score: 1

RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines

Published: April 18, 2025 | arXiv ID: 2504.13587v1

By: Quentin Romero Lauro , Shreya Shankar , Sepanta Zeighami and more

BigTech Affiliations: University of California, Berkeley

Potential Business Impact:

Helps AI assistants find correct answers faster.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Retrieval-augmented generation (RAG) pipelines have become the de-facto approach for building AI assistants with access to external, domain-specific knowledge. Given a user query, RAG pipelines typically first retrieve (R) relevant information from external sources, before invoking a Large Language Model (LLM), augmented (A) with this information, to generate (G) responses. Modern RAG pipelines frequently chain multiple retrieval and generation components, in any order. However, developing effective RAG pipelines is challenging because retrieval and generation components are intertwined, making it hard to identify which component(s) cause errors in the eventual output. The parameters with the greatest impact on output quality often require hours of pre-processing after each change, creating prohibitively slow feedback cycles. To address these challenges, we present RAGGY, a developer tool that combines a Python library of composable RAG primitives with an interactive interface for real-time debugging. We contribute the design and implementation of RAGGY, insights into expert debugging patterns through a qualitative study with 12 engineers, and design implications for future RAG tools that better align with developers' natural workflows.

Constructing and Evaluating Declarative RAG Pipelines in PyTerrier

Information Retrieval

Builds better search answers from many documents.

12 Jun 2025 2

91%

Investigating the Robustness of Retrieval-Augmented Generation at the Query Level

Computation and Language

Makes AI smarter by improving how it finds answers.

9 Jul 2025 2

91%

Never Come Up Empty: Adaptive HyDE Retrieval for Improving LLM Developer Support

Software Engineering

Makes computer helpers give better, true answers.

22 Jul 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

15 pages

RAG Without the Lag: Interactive Debugging for Retrieval-Augmented Generation Pipelines

Helps AI assistants find correct answers faster.

Technical Abstract

Constructing and Evaluating Declarative RAG Pipelines in PyTerrier

Investigating the Robustness of Retrieval-Augmented Generation at the Query Level

Never Come Up Empty: Adaptive HyDE Retrieval for Improving LLM Developer Support