iTool: Reinforced Fine-Tuning with Dynamic Deficiency Calibration for Advanced Tool Use

Yirong Zeng; Xiao Ding; Yuxian Wang; Weiwen Liu; Wu Ning; Yutai Hou; Xu Huang; Duyu Tang; Dandan Tu; Bing Qin; Ting Liu

arXiv:2501.09766·cs.CL·November 10, 2025

iTool: Reinforced Fine-Tuning with Dynamic Deficiency Calibration for Advanced Tool Use

Yirong Zeng, Xiao Ding, Yuxian Wang, Weiwen Liu, Wu Ning, Yutai Hou, Xu Huang, Duyu Tang, Dandan Tu, Bing Qin, Ting Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces iTool, a reinforcement-based fine-tuning method that dynamically calibrates model deficiencies to improve complex tool use in large language models, outperforming existing approaches.

Contribution

The paper proposes a novel iterative reinforced fine-tuning strategy that enhances synthetic data diversity and targets model deficiencies for better tool use capabilities.

Findings

01

Achieves 13.11% performance improvement over the base model.

02

Improves complex scenario performance by 6.5%.

03

Outperforms larger open-source and closed-source models.

Abstract

Augmenting large language models (LLMs) with external tools is a promising approach to enhance their capabilities, especially for complex tasks. Synthesizing tool-use data through real-world simulations is an effective way to achieve this. However, our investigation reveals that training gains significantly decay as synthetic data increases. The model struggles to benefit from additional synthetic data, which fails to endow it with advanced tool-use capabilities in complex scenarios Moreover, we discovered that the above limitation usually manifests as a fragment deficiency (i.e., parameter errors) in response. To this end, we propose an iterative reinforced fine-tuning strategy designed to alleviate this limitation. This strategy involves: (1) enhancing the diversity of response for synthetic data through path exploration of Monte Carlo Tree Search. (2) iteratively pinpointing the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

iTool: Reinforced Fine-Tuning with Dynamic Deficiency Calibration for Advanced Tool Use· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsALIGN · Shrink and Fine-Tune