RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query Parallelism
By: Zhiwen Tan , Jiaming Huang , Qintong Wu and more
Potential Business Impact:
Helps computers answer questions with newer facts.
Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks, while LLMs remain prone to generating hallucinated or outdated responses due to their static internal knowledge. Recent advancements in Retrieval-Augmented Generation (RAG) methods have aimed to enhance models' search and reasoning capabilities through reinforcement learning (RL). Although these methods demonstrate promising results, they face challenges in training stability and encounter issues such as substantial inference time and restricted capabilities due to reliance on single-query mode. In this paper, we propose RAG-R1, a novel training framework designed to enable LLMs to adaptively leverage internal and external knowledge during the reasoning process. We further expand the generation and retrieval processes within the framework from single-query mode to multi-query parallelism, with the aim of reducing inference time and enhancing the model's capabilities. Extensive experiments on seven question-answering benchmarks demonstrate that our method outperforms the strongest baseline by up to 13.2% and decreases inference time by 11.1%.
Similar Papers
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning
Computation and Language
Helps AI use its brain and search for answers.
Insight-RAG: Enhancing LLMs with Insight-Driven Augmentation
Computation and Language
Helps computers find better answers from many texts.
MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval
Computation and Language
Lets computers find better answers from many sources.