LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls
Kangning Zhang, Wenxiang Jiao, Kounianhua Du, Yuan Lu, Weiwen Liu, Weinan Zhang, Yong Yu

TL;DR
LoopTool introduces a fully automated, closed-loop framework that iteratively refines data and models for robust tool-using LLMs, significantly improving performance without relying on costly external APIs.
Contribution
We propose LoopTool, a novel framework that integrates data synthesis and model training in a closed loop, enhancing LLM tool use capabilities through self-refinement.
Findings
Model trained with LoopTool outperforms larger models on benchmarks.
Closed-loop data refinement improves model accuracy and robustness.
Eliminates dependence on expensive external APIs.
Abstract
Augmenting Large Language Models (LLMs) with external tools enables them to execute complex, multi-step tasks. However, tool learning is hampered by the static synthetic data pipelines where data generation and model training are executed as two separate, non-interactive processes. This approach fails to adaptively focus on a model's specific weaknesses and allows noisy labels to persist, degrading training efficiency. We introduce LoopTool, a fully automated, model-aware data evolution framework that closes this loop by tightly integrating data synthesis and model training. LoopTool iteratively refines both the data and the model through three synergistic modules: (1) Greedy Capability Probing (GCP) diagnoses the model's mastered and failed capabilities; (2) Judgement-Guided Label Verification (JGLV) uses an open-source judge model to find and correct annotation errors, progressively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Data Classification · Natural Language Processing Techniques
