TALM: Tool Augmented Language Models

Aaron Parisi; Yao Zhao; Noah Fiedel

arXiv:2205.12255·cs.CL·May 25, 2022·32 cites

TALM: Tool Augmented Language Models

Aaron Parisi, Yao Zhao, Noah Fiedel

PDF

Open Access

TL;DR

TALM enhances language models by integrating non-differentiable tools and iterative self-play, significantly improving performance on knowledge-heavy and reasoning tasks, especially out-of-distribution, with less reliance on scale.

Contribution

This work introduces Tool Augmented Language Models (TALM), combining non-differentiable tools with an iterative self-play method to boost performance beyond traditional scaling.

Findings

01

TALM outperforms non-augmented LMs on QA and math tasks.

02

TALM performs well on out-of-distribution inferences.

03

Less dependence on model scale improves capabilities.

Abstract

Transformer based language models (LMs) demonstrate increasing performance with scale across a wide variety of tasks. Scale alone however cannot enable models to solve tasks that require access to ephemeral, changing, or private data that was unavailable at training time. Many useful tasks may also benefit from LMs being able to access APIs that read or modify state. In this work, we present Tool Augmented Language Models (TALM), combining a text-only approach to augment language models with non-differentiable tools, and an iterative "self-play" technique to bootstrap performance starting from few tool demonstrations. TALM exhibits strong performance on both a knowledge-heavy QA task and a reasoning oriented math task with simple tools. At a given model scale, TALM significantly outperforms non-augmented LMs. We further demonstrate that TALM successfully performs out-of-distribution…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification