TL;DR
This paper introduces Trace-Free+, a curriculum learning framework that enhances LLM-agent tool use by improving tool description quality, enabling better scalability, robustness, and generalization without extensive retraining.
Contribution
It proposes a novel trace-free curriculum learning approach and constructs a large dataset to improve tool description effectiveness for LLM agents.
Findings
Reduces accuracy degradation by 29.23% in large tool catalogs
Improves average query success rate by 60.89% on StableToolBench
Enhances robustness and generalization across domains without retraining
Abstract
While most efforts to improve LLM-based tool-using agents focus on the agent itself - through larger models, better prompting, or fine-tuning - agent performance increasingly plateaus due to the quality of the tool interfaces these agents consume. Tool descriptions are often written for human developers and tolerate ambiguity that agents cannot resolve, particularly as the number of candidate tools grows. Existing approaches to improving tool interfaces (1) require re-running a multi-stage per-tool pipeline - synthesizing queries, executing an agent to collect trajectories, annotating trajectories, and prompting a strong LLM multiple times - for every API that enters the catalog, and (2) typically optimize each tool independently, limiting scalability and generalization to unseen tools. We propose Trace-Free+, a curriculum learning framework that progressively transfers supervision from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
