Score: 3

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

Published: October 28, 2025 | arXiv ID: 2510.24695v1

By: Xuanzhong Chen , Zile Qiao , Guoxin Chen and more

BigTech Affiliations: Alibaba

Potential Business Impact:

Teaches AI to solve harder problems with help.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Training large language model agents on tasks at the frontier of their capabilities is key to unlocking advanced reasoning. We introduce a data synthesis approach inspired by the educational theory of the Zone of Proximal Development (ZPD), which defines this frontier as tasks an LLM cannot solve alone but can master with guidance. To operationalize this, we present the AgentFrontier Engine, an automated pipeline that synthesizes high-quality, multidisciplinary data situated precisely within the LLM's ZPD. This engine supports both continued pre-training with knowledge-intensive data and targeted post-training on complex reasoning tasks. From the same framework, we derive the ZPD Exam, a dynamic and automated benchmark designed to evaluate agent capabilities on these frontier tasks. We train AgentFrontier-30B-A3B model on our synthesized data, which achieves state-of-the-art results on demanding benchmarks like Humanity's Last Exam, even surpassing some leading proprietary agents. Our work demonstrates that a ZPD-guided approach to data synthesis offers a scalable and effective path toward building more capable LLM agents.

Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms

Computation and Language

Teaches computers to find answers by searching the web.

15 Oct 2025 1

88%

ZPD Detector: Data Selection via Capability-Difficulty Alignment for Large Language Models

Computation and Language

Teaches computers faster with smarter data choices.

16 Jan 2026 0

87%

A Fuzzy Logic Prompting Framework for Large Language Models in Adaptive and Uncertain Tasks

Artificial Intelligence

Teaches computers to help you learn better.

8 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

30 pages

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

Teaches AI to solve harder problems with help.

Technical Abstract

Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms

ZPD Detector: Data Selection via Capability-Difficulty Alignment for Large Language Models

A Fuzzy Logic Prompting Framework for Large Language Models in Adaptive and Uncertain Tasks