TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training

Jinluan Yang; Yuxin Liu; Zhengyu Chen; Chengcheng Han; Yueqing Sun; Qi Gu; Hui Su; Xunliang Cai; Fei Wu; Kun Kuang

arXiv:2603.01714·cs.LG·March 3, 2026

TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training

Jinluan Yang, Yuxin Liu, Zhengyu Chen, Chengcheng Han, Yueqing Sun, Qi Gu, Hui Su, Xunliang Cai, Fei Wu, Kun Kuang

PDF

Open Access

TL;DR

TopoCurate introduces an interaction-aware framework that structures multi-trial tool-use trajectories into a semantic topology, improving training by emphasizing recovery, diversity, and strategic complexity.

Contribution

It proposes a novel topology-based projection of interaction trajectories and a dual-selection mechanism for SFT and RL, enhancing agent training effectiveness.

Findings

01

Achieves 4.2% improvement in SFT tasks.

02

Achieves 6.9% improvement in RL tasks.

03

Demonstrates robustness across BFCLv3 and Tau2 Bench datasets.

Abstract

Training tool-use agents typically relies on outcome-based filtering: Supervised Fine-Tuning (SFT) on successful trajectories and Reinforcement Learning (RL) on pass-rate-selected tasks. However, this paradigm ignores interaction dynamics: successful trajectories may lack error recovery or exhibit redundancy, while pass rates fail to distinguish structurally informative tasks from trivial ones. We propose \textbf{TopoCurate}, an interaction-aware framework that projects multi-trial rollouts from the same task into a unified semantic quotient topology. By merging equivalent action-observation states, this projection transforms scattered linear trajectories into a structured manifold that explicitly captures how tool invocations and environmental responses drive the divergence between effective strategies and failure modes. Leveraging this representation, we introduce a dual-selection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Mobile Crowdsensing and Crowdsourcing · Domain Adaptation and Few-Shot Learning