Think Straight, Stop Smart: Structured Reasoning for Efficient Multi-Hop RAG
By: Jihwan Bang , Juntae Lee , Seunghan Yang and more
Potential Business Impact:
Makes AI answer questions faster and more reliably.
Multi-hop retrieval-augmented generation (RAG) is a promising strategy for complex reasoning, yet existing iterative prompting approaches remain inefficient. They often regenerate predictable token sequences at every step and rely on stochastic stopping, leading to excessive token usage and unstable termination. We propose TSSS (Think Straight, Stop Smart), a structured multi-hop RAG framework designed for efficiency. TSSS introduces (i) a template-based reasoning that caches recurring prefixes and anchors sub-queries to the main question, reducing token generation cost while promoting stable reasoning, and (ii) a retriever-based terminator, which deterministically halts reasoning once additional sub-queries collapse into repetition. This separation of structured reasoning and termination control enables both faster inference and more reliable answers. On HotpotQA, 2WikiMultiHop, and MuSiQue, TSSS achieves state-of-the-art accuracy and competitive efficiency among RAG-CoT approaches, highlighting its effectiveness in efficiency-constrained scenarios such as on-device inference.
Similar Papers
StepChain GraphRAG: Reasoning Over Knowledge Graphs for Multi-Hop Question Answering
Computation and Language
Answers complex questions by finding connected facts.
Beyond Chunks and Graphs: Retrieval-Augmented Generation through Triplet-Driven Thinking
Information Retrieval
Makes AI answer questions more accurately and cheaply.
Transforming Questions and Documents for Semantically Aligned Retrieval-Augmented Generation
Computation and Language
Answers hard questions by breaking them down.