Score: 1

An Open and Reproducible Deep Research Agent for Long-Form Question Answering

Published: December 15, 2025 | arXiv ID: 2512.13059v1

By: Ikuya Yamada , Wataru Ikeda , Ko Yoshida and more

Potential Business Impact:

Helps computers answer hard questions by searching and thinking.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

We present an open deep research system for long-form question answering, selected as a winning system in the text-to-text track of the MMU-RAG competition at NeurIPS 2025. The system combines an open-source large language model (LLM) with an open web search API to perform iterative retrieval, reasoning, and synthesis in real-world open-domain settings. To enhance reasoning quality, we apply preference tuning based on LLM-as-a-judge feedback that evaluates multiple aspects, including clarity, insightfulness, and factuality. Our experimental results show that the proposed method consistently improves answer quality across all three aspects. Our source code is publicly available at https://github.com/efficient-deep-research/efficient-deep-research.

Country of Origin
🇯🇵 Japan


Page Count
13 pages

Category
Computer Science:
Computation and Language