Score: 2

AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning

Published: December 18, 2025 | arXiv ID: 2512.16883v1

By: Tzu-Han Lin , Wei-Lin Chen , Chen-An Li and more

Potential Business Impact:

Helps AI know when to search for answers.

Business Areas:

Semantic Search Internet Services

Equipping large language models (LLMs) with search engines via reinforcement learning (RL) has emerged as an effective approach for building search agents. However, overreliance on search introduces unnecessary cost and risks exposure to noisy or malicious content, while relying solely on parametric knowledge risks hallucination. The central challenge is to develop agents that adaptively balance parametric knowledge with external search, invoking search only when necessary. Prior work mitigates search overuse by shaping rewards around the number of tool calls. However, these penalties require substantial reward engineering, provide ambiguous credit assignment, and can be exploited by agents that superficially reduce calls. Moreover, evaluating performance solely through call counts conflates necessary and unnecessary search, obscuring the measurement of true adaptive behavior. To address these limitations, we first quantify the self-knowledge awareness of existing search agents via an F1-based decision metric, revealing that methods such as Search-R1 often overlook readily available parametric knowledge. Motivated by these findings, we propose AdaSearch, a simple two-stage, outcome-driven RL framework that disentangles problem solving from the decision of whether to invoke search, and makes this decision process explicit and interpretable. This transparency is crucial for high-stakes domains such as finance and medical question answering, yet is largely neglected by prior approaches. Experiments across multiple model families and sizes demonstrate that AdaSearch substantially improves knowledge-boundary awareness, reduces unnecessary search calls, preserves strong task performance, and offers more transparent, interpretable decision behaviors.

AI-SearchPlanner: Modular Agentic Search via Pareto-Optimal Multi-Objective Reinforcement Learning

Artificial Intelligence

Helps AI find answers better by planning searches.

28 Aug 2025 2

91%

AI-SearchPlanner: Modular Agentic Search via Pareto-Optimal Multi-Objective Reinforcement Learning

Artificial Intelligence

Helps AI find answers better by planning searches.

28 Aug 2025 2

90%

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications

Artificial Intelligence

Teaches computers to find better answers online.

19 Oct 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 🇹🇼 Taiwan, Province of China, United States

Repos / Data Links

github.com

Page Count

32 pages

AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning

Helps AI know when to search for answers.

Technical Abstract

AI-SearchPlanner: Modular Agentic Search via Pareto-Optimal Multi-Objective Reinforcement Learning

AI-SearchPlanner: Modular Agentic Search via Pareto-Optimal Multi-Objective Reinforcement Learning

A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications