Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning

Zhiyuan Ma; Jiayu Liu; Xianzhen Luo; Zhenya Huang; Qingfu Zhu; Wanxiang Che

arXiv:2506.04625·cs.CL·June 6, 2025

Advancing Tool-Augmented Large Language Models via Meta-Verification and Reflection Learning

Zhiyuan Ma, Jiayu Liu, Xianzhen Luo, Zhenya Huang, Qingfu Zhu, Wanxiang Che

PDF

1 Repo

TL;DR

This paper introduces Tool-MVR, a tool-augmented LLM that improves tool planning and reflection through systematic validation and dynamic learning, achieving state-of-the-art results and better error correction.

Contribution

The paper presents Tool-MVR, combining Multi-Agent Meta-Verification and Exploration-based Reflection Learning to enhance tool usage and reflection in large language models.

Findings

01

Achieves 23.9% improvement over ToolLLM on StableToolBench.

02

Reduces API calls by 31.4%.

03

Attains 58.9% error correction rate on RefineToolBench.

Abstract

Empowering large language models (LLMs) with effective tool utilization capabilities is crucial for enabling AI agents to solve complex problems. However, current models face two major limitations: (1) unreliable tool planning and invocation due to low-quality instruction datasets (e.g., widespread hallucinated API calls), and (2) weak tool reflection abilities (over 90% of errors cannot be corrected) resulting from static imitation learning. To address these critical limitations, we propose Tool-MVR, a novel Tool-Augmented LLM that achieves comprehensive System 2 reasoning through two key innovations. Specifically, we first introduce Multi-Agent Meta-Verification (MAMV), a systematic pipeline that rigorously validates APIs, queries, and reasoning trajectories to construct ToolBench-V, a new high-quality instruction dataset that addresses the limitation of unreliable tool planning and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhymma/tool-mvr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAbsolute Position Encodings · Layer Normalization · Byte Pair Encoding · Label Smoothing · Softmax · Dropout · Dense Connections · Transformer · GPT-4