Linguistic and Argument Diversity in Synthetic Data for Function-Calling Agents

Dan Greenstein; Zohar Karnin; Chen Amiraz; Oren Somekh

arXiv:2601.17829·cs.CL·January 27, 2026

Linguistic and Argument Diversity in Synthetic Data for Function-Calling Agents

Dan Greenstein, Zohar Karnin, Chen Amiraz, Oren Somekh

PDF

Open Access

TL;DR

This paper introduces a method for generating diverse synthetic datasets for function-calling agents, enhancing linguistic and argument diversity without manual rules, leading to improved model performance especially out-of-distribution.

Contribution

It proposes an optimization-based approach for synthetic data generation that improves diversity and model accuracy without relying on handcrafted rules.

Findings

01

Outperforms state-of-the-art data generation methods in diversity metrics

02

Achieves 7.4% higher accuracy on BFCL benchmark

03

Maintains comparable correctness while increasing diversity

Abstract

The construction of function calling agents has emerged as a promising avenue for extending model capabilities. A major challenge for this task is obtaining high quality diverse data for training. Prior work emphasizes diversity in functions, invocation patterns, and interaction turns, yet linguistic diversity of requests and coverage of arguments (e.g., \texttt{city\_name}, \texttt{stock\_ticker}) remain underexplored. We propose a method that generates synthetic datasets via optimizing general-purpose diversity metrics across both queries and arguments, without relying on hand-crafted rules or taxonomies, making it robust to different usecases. We demonstrate the effectiveness of our technique via both intrinsic and extrinsic testing, comparing it to SoTA data generation methods. We show a superiority over baselines in terms of diversity, while keeping comparable correctness.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Mobile Crowdsensing and Crowdsourcing