Score: 2

FrugalRAG: Learning to retrieve and reason for multi-hop QA

Published: July 10, 2025 | arXiv ID: 2507.07634v2

By: Abhinav Java , Srivathsan Koundinyan , Nagarajan Natarajan and more

BigTech Affiliations: Microsoft

Potential Business Impact:

Answers questions using fewer searches.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We consider the problem of answering complex questions, given access to a large unstructured document corpus. The de facto approach to solving the problem is to leverage language models that (iteratively) retrieve and reason through the retrieved documents, until the model has sufficient information to generate an answer. Attempts at improving this approach focus on retrieval-augmented generation (RAG) metrics such as accuracy and recall and can be categorized into two types: (a) fine-tuning on large question answering (QA) datasets augmented with chain-of-thought traces, and (b) leveraging RL-based fine-tuning techniques that rely on question-document relevance signals. However, efficiency in the number of retrieval searches is an equally important metric, which has received less attention. In this work, we show that: (1) Large-scale fine-tuning is not needed to improve RAG metrics, contrary to popular claims in recent literature. Specifically, a standard ReAct pipeline with improved prompts can outperform state-of-the-art methods on benchmarks such as HotPotQA. (2) Supervised and RL-based fine-tuning can help RAG from the perspective of frugality, i.e., the latency due to number of searches at inference time. For example, we show that we can achieve competitive RAG metrics at nearly half the cost (in terms of number of searches) on popular RAG benchmarks, using the same base model, and at a small training cost (1000 examples).

Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems

Information Retrieval

Helps computers answer money questions better.

19 Mar 2025 2

93%

Investigating the Robustness of Retrieval-Augmented Generation at the Query Level

Computation and Language

Makes AI smarter by improving how it finds answers.

9 Jul 2025 2

93%

Transforming Questions and Documents for Semantically Aligned Retrieval-Augmented Generation

Computation and Language

Answers hard questions by breaking them down.

13 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

15 pages

FrugalRAG: Learning to retrieve and reason for multi-hop QA

Answers questions using fewer searches.

Technical Abstract

Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems

Investigating the Robustness of Retrieval-Augmented Generation at the Query Level

Transforming Questions and Documents for Semantically Aligned Retrieval-Augmented Generation