Beyond Quantity: Trajectory Diversity Scaling for Code Agents

Guhong Chen; Chenghao Sun; Cheng Fu; Qiyao Wang; Zhihong Huang; Chaopeng Wei; Guangxu Chen; Feiteng Fang; Ahmadreza Argha; Bing Zhao; Xander Xu; Qi Han; Hamid Alinejad-Rokny; Qiang Qu; Binhua Li; Shiwen Ni; Min Yang; Hu Wei; Yongbin Li

arXiv:2602.03219·cs.AI·February 10, 2026

Beyond Quantity: Trajectory Diversity Scaling for Code Agents

Guhong Chen, Chenghao Sun, Cheng Fu, Qiyao Wang, Zhihong Huang, Chaopeng Wei, Guangxu Chen, Feiteng Fang, Ahmadreza Argha, Bing Zhao, Xander Xu, Qi Han, Hamid Alinejad-Rokny, Qiang Qu, Binhua Li, Shiwen Ni, Min Yang, Hu Wei, Yongbin Li

PDF

Open Access

TL;DR

TDScaling introduces a trajectory diversity-based data synthesis framework that enhances code agent performance by focusing on diversity rather than quantity, addressing limitations of synthetic data quality and diminishing returns.

Contribution

The paper presents TDScaling, a novel framework that improves code agent training through trajectory diversity scaling, integrating four key innovations for better generalization and proficiency.

Findings

01

Improves tool-use generalization on benchmarks

02

Enhances inherent coding capabilities

03

Outperforms quantity-based scaling methods

Abstract

As code large language models (LLMs) evolve into tool-interactive agents via the Model Context Protocol (MCP), their generalization is increasingly limited by low-quality synthetic data and the diminishing returns of quantity scaling. Moreover, quantity-centric scaling exhibits an early bottleneck that underutilizes trajectory data. We propose TDScaling, a Trajectory Diversity Scaling-based data synthesis framework for code agents that scales performance through diversity rather than raw volume. Under a fixed training budget, increasing trajectory diversity yields larger gains than adding more trajectories, improving the performance-cost trade-off for agent training. TDScaling integrates four innovations: (1) a Business Cluster mechanism that captures real-service logical dependencies; (2) a blueprint-driven multi-agent paradigm that enforces trajectory coherence; (3) an adaptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research · Machine Learning and Algorithms