Meta-Reasoning Improves Tool Use in Large Language Models

Lisa Alazraki; Marek Rei

arXiv:2411.04535·cs.CL·September 23, 2025

Meta-Reasoning Improves Tool Use in Large Language Models

Lisa Alazraki, Marek Rei

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces TECTON, a meta-reasoning system that enhances tool selection in large language models, leading to significant improvements in math reasoning tasks both within and outside the training distribution.

Contribution

The paper proposes a novel two-phase meta-reasoning approach for tool selection, outperforming existing greedy decoding methods in large language models.

Findings

01

Substantial performance gains on math reasoning datasets

02

Effective tool selection in out-of-distribution scenarios

03

Improved reasoning accuracy with meta-reasoning approach

Abstract

External tools help large language models succeed at tasks where they would otherwise typically fail. In existing frameworks, choosing tools at test time relies on naive greedy decoding, regardless of whether the model has been fine-tuned on tool-annotated data or prompted with in-context examples. In contrast, we find that gathering and choosing among a suitable set of candidate tools has greater potential to lead to an optimal selection. We present Tool selECTion via meta-reasONing (TECTON), a two-phase system that first reasons over a task and outputs candidate tools using a custom fine-tuned language modelling head. Then, with the custom head disabled, it meta-reasons (i.e., it reasons over the previous reasoning process) to make a final choice. We show that TECTON results in substantial gains--both in-distribution and out-of-distribution--on a range of math reasoning datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lisaalaz/tecton
pytorchOfficial

Videos

Meta-Reasoning Improves Tool Use in Large Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies