SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
By: Xiaofeng Shi , Yuduo Li , Qian Kou and more
Potential Business Impact:
Finds research papers much better.
Recent advances in large language models (LLMs) have opened new opportunities for academic literature retrieval. However, existing systems often rely on rigid pipelines and exhibit limited reasoning capabilities. We introduce SPAR, a multi-agent framework that incorporates RefChain-based query decomposition and query evolution to enable more flexible and effective search. To facilitate systematic evaluation, we also construct SPARBench, a challenging benchmark with expert-annotated relevance labels. Experimental results demonstrate that SPAR substantially outperforms strong baselines, achieving up to +56% F1 on AutoScholar and +23% F1 on SPARBench over the best-performing baseline. Together, SPAR and SPARBench provide a scalable, interpretable, and high-performing foundation for advancing research in scholarly retrieval. Code and data will be available at: https://github.com/xiaofengShi/SPAR
Similar Papers
SPAR: Session-based Pipeline for Adaptive Retrieval on Legacy File Systems
Information Retrieval
Finds old computer files faster and smarter.
FIRESPARQL: A LLM-based Framework for SPARQL Query Generation over Scholarly Knowledge Graphs
Artificial Intelligence
Helps computers answer questions from science papers.
SPARQL-LLM: Real-Time SPARQL Query Generation from Natural Language Questions
Information Retrieval
Computers understand questions to find data.