Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing
Yuwen Li, Wei Zhang, Zelong Huang, Mason Yang, Jiajun Wu, Shawn Guo, Huahao Hu, Lingyi Sun, Jian Yang, Mingjie Tang, Byran Dai

TL;DR
InfTool is an autonomous multi-agent framework that synthesizes diverse tool-use data for large language models, significantly improving their ability to invoke external tools without human annotation.
Contribution
The paper introduces InfTool, a self-evolving multi-agent system that generates high-quality synthetic data for training LLMs to use external tools, overcoming annotation and generalization challenges.
Findings
Achieved 70.9% accuracy on BFCL, a 258% improvement over the base model.
Surpassed larger models and rivaled Claude-Opus using only synthetic data.
Demonstrated effective closed-loop training without human intervention.
Abstract
Enabling Large Language Models (LLMs) to reliably invoke external tools remains a critical bottleneck for autonomous agents. Existing approaches suffer from three fundamental challenges: expensive human annotation for high-quality trajectories, poor generalization to unseen tools, and quality ceilings inherent in single-model synthesis that perpetuate biases and coverage gaps. We introduce InfTool, a fully autonomous framework that breaks these barriers through self-evolving multi-agent synthesis. Given only raw API specifications, InfTool orchestrates three collaborative agents (User Simulator, Tool-Calling Assistant, and MCP Server) to generate diverse, verified trajectories spanning single-turn calls to complex multi-step workflows. The framework establishes a closed loop: synthesized data trains the model via Group Relative Policy Optimization (GRPO) with gated rewards, the improved…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗IQuestLab/IQuest-Coder-V1-7B-Instructmodel· 2.2k dl· ♡ 172.2k dl♡ 17
- 🤗IQuestLab/IQuest-Coder-V1-7B-Thinkingmodel· 423 dl· ♡ 9423 dl♡ 9
- 🤗IQuestLab/IQuest-Coder-V1-40B-Instructmodel· 12k dl· ♡ 28912k dl♡ 289
- 🤗IQuestLab/IQuest-Coder-V1-40B-Loop-Instructmodel· 12k dl· ♡ 32412k dl♡ 324
- 🤗IQuestLab/IQuest-Coder-V1-40B-Thinkingmodel· 330 dl· ♡ 16330 dl♡ 16
- 🤗IQuestLab/IQuest-Coder-V1-40B-Loop-Thinkingmodel· 162 dl· ♡ 12162 dl♡ 12
- 🤗IQuestLab/IQuest-Coder-V1-7B-Basemodel· 113 dl· ♡ 10113 dl♡ 10
- 🤗IQuestLab/IQuest-Coder-V1-40B-Base-Stage1model· 23 dl· ♡ 2823 dl♡ 28
- 🤗IQuestLab/IQuest-Coder-V1-40B-Basemodel· 114 dl· ♡ 46114 dl♡ 46
- 🤗cyankiwi/IQuest-Coder-V1-40B-Instruct-AWQ-4bitmodel· 26 dl· ♡ 326 dl♡ 3
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education
