Nested Browser-Use Learning for Agentic Information Seeking
By: Baixuan Li , Jialong Wu , Wenbiao Yin and more
Potential Business Impact:
Lets computers explore websites like people do.
Information-seeking (IS) agents have achieved strong performance across a range of wide and deep search tasks, yet their tool use remains largely restricted to API-level snippet retrieval and URL-based page fetching, limiting access to the richer information available through real browsing. While full browser interaction could unlock deeper capabilities, its fine-grained control and verbose page content returns introduce substantial complexity for ReAct-style function-calling agents. To bridge this gap, we propose Nested Browser-Use Learning (NestBrowse), which introduces a minimal and complete browser-action framework that decouples interaction control from page exploration through a nested structure. This design simplifies agentic reasoning while enabling effective deep-web information acquisition. Empirical results on challenging deep IS benchmarks demonstrate that NestBrowse offers clear benefits in practice. Further in-depth analyses underscore its efficiency and flexibility.
Similar Papers
WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking
Computation and Language
Helps AI find answers online much faster.
Go-Browse: Training Web Agents with Structured Exploration
Computation and Language
Helps robots learn to use websites better.
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents
Computation and Language
Helps computers find answers by searching the web.