TodoEvolve: Learning to Architect Agent Planning Systems

Jiaxi Liu; Yanzuo Jiang; Guibin Zhang; Zihan Zhang; Heng Chang; Zhenfei Yin; Qibing Ren; Junchi Yan

arXiv:2602.07839·cs.CL·February 10, 2026

TodoEvolve: Learning to Architect Agent Planning Systems

Jiaxi Liu, Yanzuo Jiang, Guibin Zhang, Zihan Zhang, Heng Chang, Zhenfei Yin, Qibing Ren, Junchi Yan

PDF

Open Access 1 Models

TL;DR

TodoEvolve is a novel meta-planning framework that autonomously synthesizes adaptable planning architectures for agents, outperforming fixed structures across diverse benchmarks with efficient resource use.

Contribution

It introduces PlanFactory for modular planning design and employs IGPO for training adaptable, high-performing planning systems.

Findings

01

Outperforms fixed planning modules on five benchmarks

02

Maintains low API costs and runtime overhead

03

Generates adaptable planning architectures

Abstract

Planning has become a central capability for contemporary agent systems in navigating complex, long-horizon tasks, yet existing approaches predominantly rely on fixed, hand-crafted planning structures that lack the flexibility to adapt to the structural diversity of open-ended problems. To address this limitation, we introduce TodoEvolve, a meta-planning paradigm that autonomously synthesizes and dynamically revises task-specific planning architectures. Specifically, we first construct PlanFactory, a modular design space that standardizes diverse planning paradigms within a unified codebase encompassing topology, initialization, adaptation, and navigation, thereby providing a common interface for heterogeneous planning patterns. Leveraging PlanFactory, we collect high-quality planning trajectories and train Todo-14B via \textit{Impedance-Guided Preference Optimization} (IGPO), a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
EcthelionLiu/Todo-14B
model· 27 dl· ♡ 1
27 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms