AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

Xuanzhong Chen; Zile Qiao; Guoxin Chen; Liangcai Su; Zhen Zhang; Xinyu Wang; Pengjun Xie; Fei Huang; Jingren Zhou; Yong Jiang

arXiv:2510.24695·cs.CL·October 29, 2025

AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis

Xuanzhong Chen, Zile Qiao, Guoxin Chen, Liangcai Su, Zhen Zhang, Xinyu Wang, Pengjun Xie, Fei Huang, Jingren Zhou, Yong Jiang

PDF

TL;DR

This paper introduces a ZPD-guided data synthesis method for training large language model agents, enabling them to master complex tasks and achieve state-of-the-art performance on challenging benchmarks.

Contribution

It presents the AgentFrontier Engine for automated, ZPD-aligned data synthesis and introduces the ZPD Exam benchmark for evaluating frontier capabilities.

Findings

01

Achieved state-of-the-art results on Humanity's Last Exam.

02

Synthesized high-quality, multidisciplinary data within the LLM's ZPD.

03

Surpassed some proprietary agents on complex reasoning benchmarks.

Abstract

Training large language model agents on tasks at the frontier of their capabilities is key to unlocking advanced reasoning. We introduce a data synthesis approach inspired by the educational theory of the Zone of Proximal Development (ZPD), which defines this frontier as tasks an LLM cannot solve alone but can master with guidance. To operationalize this, we present the AgentFrontier Engine, an automated pipeline that synthesizes high-quality, multidisciplinary data situated precisely within the LLM's ZPD. This engine supports both continued pre-training with knowledge-intensive data and targeted post-training on complex reasoning tasks. From the same framework, we derive the ZPD Exam, a dynamic and automated benchmark designed to evaluate agent capabilities on these frontier tasks. We train AgentFrontier-30B-A3B model on our synthesized data, which achieves state-of-the-art results on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.