LURE-RAG: Lightweight Utility-driven Reranking for Efficient RAG
By: Manish Chandra, Debasis Ganguly, Iadh Ounis
Potential Business Impact:
Helps AI answer questions better by ordering information.
Most conventional Retrieval-Augmented Generation (RAG) pipelines rely on relevance-based retrieval, which often misaligns with utility -- that is, whether the retrieved passages actually improve the quality of the generated text specific to a downstream task such as question answering or query-based summarization. The limitations of existing utility-driven retrieval approaches for RAG are that, firstly, they are resource-intensive typically requiring query encoding, and that secondly, they do not involve listwise ranking loss during training. The latter limitation is particularly critical, as the relative order between documents directly affects generation in RAG. To address this gap, we propose Lightweight Utility-driven Reranking for Efficient RAG (LURE-RAG), a framework that augments any black-box retriever with an efficient LambdaMART-based reranker. Unlike prior methods, LURE-RAG trains the reranker with a listwise ranking loss guided by LLM utility, thereby directly optimizing the ordering of retrieved documents. Experiments on two standard datasets demonstrate that LURE-RAG achieves competitive performance, reaching 97-98% of the state-of-the-art dense neural baseline, while remaining efficient in both training and inference. Moreover, its dense variant, UR-RAG, significantly outperforms the best existing baseline by up to 3%.
Similar Papers
Distilling a Small Utility-Based Passage Selector to Enhance Retrieval-Augmented Generation
Information Retrieval
Makes AI answers better by finding useful facts.
L-RAG: Balancing Context and Retrieval with Entropy-Based Lazy Loading
Information Retrieval
Makes AI smarter by only looking up facts when needed.
LIR$^3$AG: A Lightweight Rerank Reasoning Strategy Framework for Retrieval-Augmented Generation
Computation and Language
Makes smart computers answer questions faster and better.