Query Decomposition for RAG: Balancing Exploration-Exploitation
By: Roxana Petcu , Kenton Murray , Daniel Khashabi and more
Potential Business Impact:
Helps computers find better answers by asking smarter questions.
Retrieval-augmented generation (RAG) systems address complex user requests by decomposing them into subqueries, retrieving potentially relevant documents for each, and then aggregating them to generate an answer. Efficiently selecting informative documents requires balancing a key trade-off: (i) retrieving broadly enough to capture all the relevant material, and (ii) limiting retrieval to avoid excessive noise and computational cost. We formulate query decomposition and document retrieval in an exploitation-exploration setting, where retrieving one document at a time builds a belief about the utility of a given sub-query and informs the decision to continue exploiting or exploring an alternative. We experiment with a variety of bandit learning methods and demonstrate their effectiveness in dynamically selecting the most informative sub-queries. Our main finding is that estimating document relevance using rank information and human judgments yields a 35% gain in document-level precision, 15% increase in {\alpha}-nDCG, and better performance on the downstream task of long-form generation.
Similar Papers
Question Decomposition for Retrieval-Augmented Generation
Computation and Language
Helps computers answer tricky questions from many sources.
Decomposed Reasoning with Reinforcement Learning for Relevance Assessment in UGC Platforms
Information Retrieval
Helps computers understand messy online posts better.
Metadata-Driven Retrieval-Augmented Generation for Financial Question Answering
Information Retrieval
Helps computers understand long financial papers better.