LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

Kangning Zhang; Wenxiang Jiao; Kounianhua Du; Yuan Lu; Weiwen Liu; Weinan Zhang; Yong Yu

arXiv:2511.09148·cs.CL·November 19, 2025

LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls

Kangning Zhang, Wenxiang Jiao, Kounianhua Du, Yuan Lu, Weiwen Liu, Weinan Zhang, Yong Yu

PDF

Open Access 2 Models 1 Datasets

TL;DR

LoopTool introduces a fully automated, closed-loop framework that iteratively refines data and models for robust tool-using LLMs, significantly improving performance without relying on costly external APIs.

Contribution

We propose LoopTool, a novel framework that integrates data synthesis and model training in a closed loop, enhancing LLM tool use capabilities through self-refinement.

Findings

01

Model trained with LoopTool outperforms larger models on benchmarks.

02

Closed-loop data refinement improves model accuracy and robustness.

03

Eliminates dependence on expensive external APIs.

Abstract

Augmenting Large Language Models (LLMs) with external tools enables them to execute complex, multi-step tasks. However, tool learning is hampered by the static synthetic data pipelines where data generation and model training are executed as two separate, non-interactive processes. This approach fails to adaptively focus on a model's specific weaknesses and allows noisy labels to persist, degrading training efficiency. We introduce LoopTool, a fully automated, model-aware data evolution framework that closes this loop by tightly integrating data synthesis and model training. LoopTool iteratively refines both the data and the model through three synergistic modules: (1) Greedy Capability Probing (GCP) diagnoses the model's mastered and failed capabilities; (2) Judgement-Guided Label Verification (JGLV) uses an open-source judge model to find and correct annotation errors, progressively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

zhuiguang-ning/LoopTool-23k
dataset· 28 dl
28 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning and Data Classification · Natural Language Processing Techniques