Simulating Complex Multi-Turn Tool Calling Interactions in Stateless Execution Environments
Maxwell Crouse, Ibrahim Abdelaziz, Kshitij Fadnis, Siva Sankalp Patel, Kinjal Basu, Chulaka Gunasekara, Sadhana Kumaravel, Asim Munawar, Pavan Kapanipathi

TL;DR
This paper introduces DiGiT-TC, a data generation method for simulating multi-turn tool calling interactions in stateless environments, addressing a gap in synthetic data creation for real-world, security-sensitive settings.
Contribution
The paper presents a novel generation pattern that enables the creation of realistic tool calling conversations without relying on stateful execution environments.
Findings
DiGiT-TC achieves strong performance gains on standard benchmarks.
It effectively simulates stateful interactions in stateless settings.
The approach is validated across multiple tool calling scenarios.
Abstract
Synthetic data has proven itself to be a valuable resource for tuning smaller, cost-effective language models to handle the complexities of multi-turn tool calling conversations. While many frameworks and systems for producing synthetic multi-turn tool calling data have been proposed, prior works have frequently assumed that any tool calling interactions will take place in an execution environment that maintains state. When such an environment is available, this is advantageous as it allows for the validity of an interaction to be determined by whether or not the state of the execution environment matches to some prespecified objective. Unfortunately, this does not hold in many real-world tool use settings, e.g., in enterprise settings where data security is of the utmost importance or in cases where tool specifications are synthesized from multiple sources. In this work, we address…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
