Hypothetical Documents or Knowledge Leakage? Rethinking LLM-based Query Expansion
By: Yejun Yoon , Jaeyoon Jung , Seunghyun Yoon and more
Potential Business Impact:
AI might be cheating to find answers.
Query expansion methods powered by large language models (LLMs) have demonstrated effectiveness in zero-shot retrieval tasks. These methods assume that LLMs can generate hypothetical documents that, when incorporated into a query vector, enhance the retrieval of real evidence. However, we challenge this assumption by investigating whether knowledge leakage in benchmarks contributes to the observed performance gains. Using fact verification as a testbed, we analyze whether the generated documents contain information entailed by ground-truth evidence and assess their impact on performance. Our findings indicate that, on average, performance improvements consistently occurred for claims whose generated documents included sentences entailed by gold evidence. This suggests that knowledge leakage may be present in fact-verification benchmarks, potentially inflating the perceived performance of LLM-based query expansion methods.
Similar Papers
Enhancing LLM Knowledge Learning through Generalization
Computation and Language
Helps computers remember new facts without forgetting old ones.
Improving Factuality in LLMs via Inference-Time Knowledge Graph Construction
Computation and Language
Makes AI answers more truthful by building knowledge maps.
Question Answering under Temporal Conflict: Evaluating and Organizing Evolving Knowledge with LLMs
Computation and Language
Helps computers remember and use new facts.