ThinkQE: Query Expansion via an Evolving Thinking Process
By: Yibin Lei, Tao Shen, Andrew Yates
Potential Business Impact:
Finds better search results by thinking more.
Effective query expansion for web search benefits from promoting both exploration and result diversity to capture multiple interpretations and facets of a query. While recent LLM-based methods have improved retrieval performance and demonstrate strong domain generalization without additional training, they often generate narrowly focused expansions that overlook these desiderata. We propose ThinkQE, a test-time query expansion framework addressing this limitation through two key components: a thinking-based expansion process that encourages deeper and comprehensive semantic exploration, and a corpus-interaction strategy that iteratively refines expansions using retrieval feedback from the corpus. Experiments on diverse web search benchmarks (DL19, DL20, and BRIGHT) show ThinkQE consistently outperforms prior approaches, including training-intensive dense retrievers and rerankers.
Similar Papers
Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey
Information Retrieval
Helps computers find better answers to your questions.
Ontology-Guided Query Expansion for Biomedical Document Retrieval using Large Language Models
Information Retrieval
Helps find medical answers in science papers.
TCDE: Topic-Centric Dual Expansion of Queries and Documents with Large Language Models for Information Retrieval
Information Retrieval
Helps computers find information better by understanding topics.