ToolWeave: Structured Synthesis of Complex Multi-Turn Tool-Calling Dialogues
Dinesh Khandelwal, Gnana Prakash Punnavajhala, GPS Bhargav, Gaurav Pandey, Sachin Joshi, Hima Karanam, Dinesh Raghu

TL;DR
ToolWeave is a structured framework that synthesizes realistic multi-turn tool-calling dialogues, improving the quality and diversity of training data for LLMs as autonomous agents.
Contribution
It introduces a novel data synthesis method that constructs aligned, multi-step workflows with reduced hallucinations, enhancing LLM performance on tool-calling tasks.
Findings
Synthetic dialogues have 45% more multi-step interactions.
Fewer hallucinations in parameters and tool names.
Improved LLM performance on public benchmarks.
Abstract
Multi-turn tool calling is essential for LLMs to function as autonomous agents, yet synthesizing the training data required for these capabilities remains a fundamental challenge. Existing synthetic data generation pipelines often produce unrealistic dialogues for two reasons: they chain tools that are only superficially compatible rather than aligned with meaningful user tasks, and they generate dialogues in one shot, which often introduces arguments that were neither provided by the user nor produced by prior tool calls. These issues also lead to a severe underrepresentation of multi-step tool interactions. We introduce ToolWeave, a structured framework for synthesizing realistic multi-turn tool-calling dialogues. ToolWeave support realistic multi-step workflows (or tool sequences) by constructing tools with built-in dependencies and filters the workflows based on alignment with user…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
