Score: 1

Knowledge Compression via Question Generation: Enhancing Multihop Document Retrieval without Fine-tuning

Published: June 9, 2025 | arXiv ID: 2506.13778v1

By: Anvi Alex Eponon , Moein Shahiki-Tash , Ildar Batyrshin and more

Potential Business Impact:

Helps computers find answers by asking questions.

Business Areas:

Semantic Search Internet Services

This study presents a question-based knowledge encoding approach that improves retrieval-augmented generation (RAG) systems without requiring fine-tuning or traditional chunking. We encode textual content using generated questions that span the lexical and semantic space, creating targeted retrieval cues combined with a custom syntactic reranking method. In single-hop retrieval over 109 scientific papers, our approach achieves a Recall@3 of 0.84, outperforming traditional chunking methods by 60 percent. We also introduce "paper-cards", concise paper summaries under 300 characters, which enhance BM25 retrieval, increasing MRR@3 from 0.56 to 0.85 on simplified technical queries. For multihop tasks, our reranking method reaches an F1 score of 0.52 with LLaMA2-Chat-7B on the LongBench 2WikiMultihopQA dataset, surpassing chunking and fine-tuned baselines which score 0.328 and 0.412 respectively. This method eliminates fine-tuning requirements, reduces retrieval latency, enables intuitive question-driven knowledge access, and decreases vector storage demands by 80%, positioning it as a scalable and efficient RAG alternative.

Transforming Questions and Documents for Semantically Aligned Retrieval-Augmented Generation

Computation and Language

Answers hard questions by breaking them down.

13 Aug 2025 0

93%

Enhancing Document-Level Question Answering via Multi-Hop Retrieval-Augmented Generation with LLaMA 3

Computation and Language

Answers hard questions from long texts better.

19 Jun 2025 0

93%

FrugalRAG: Learning to retrieve and reason for multi-hop QA

Computation and Language

Answers questions using fewer searches.

10 Jul 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

13 pages

Knowledge Compression via Question Generation: Enhancing Multihop Document Retrieval without Fine-tuning

Helps computers find answers by asking questions.

Technical Abstract

Transforming Questions and Documents for Semantically Aligned Retrieval-Augmented Generation

Enhancing Document-Level Question Answering via Multi-Hop Retrieval-Augmented Generation with LLaMA 3

FrugalRAG: Learning to retrieve and reason for multi-hop QA