GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation
Jie He, Jennifer Neville, Mengting Wan, Longqi Yang, Hui Liu, Xiaofeng, Xu, Xia Song, Jeff Z. Pan, Pei Zhou

TL;DR
GenTool is a training framework that significantly improves large language models' ability to generalize in tool usage, enabling them to handle unseen queries and leverage better tools, thus enhancing their AI assistant capabilities.
Contribution
The paper introduces GenTool, a novel two-stage fine-tuning approach with synthetic data to enhance LLMs' zero-to-one and weak-to-strong tool generalization capabilities.
Findings
Substantially improves tool-usage performance across models from 1B to 8B parameters.
Outperforms GPT-4o in generalization scenarios.
Provides insights into challenges faced by LLMs in tool generalization.
Abstract
Large Language Models (LLMs) can enhance their capabilities as AI assistants by integrating external tools, allowing them to access a wider range of information. While recent LLMs are typically fine-tuned with tool usage examples during supervised fine-tuning (SFT), questions remain about their ability to develop robust tool-usage skills and can effectively generalize to unseen queries and tools. In this work, we present GenTool, a novel training framework that prepares LLMs for diverse generalization challenges in tool utilization. Our approach addresses two fundamental dimensions critical for real-world applications: Zero-to-One Generalization, enabling the model to address queries initially lacking a suitable tool by adopting and utilizing one when it becomes available, and Weak-to-Strong Generalization, allowing models to leverage enhanced versions of existing tools to solve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
