PAL: Program-aided Language Models

Luyu Gao; Aman Madaan; Shuyan Zhou; Uri Alon; Pengfei Liu; Yiming; Yang; Jamie Callan; Graham Neubig

arXiv:2211.10435·cs.CL·January 30, 2023·104 cites

PAL: Program-aided Language Models

Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, Pengfei Liu, Yiming, Yang, Jamie Callan, Graham Neubig

PDF

Open Access 3 Repos 1 Datasets

TL;DR

PAL leverages language models to generate reasoning programs that are executed by a Python interpreter, significantly improving accuracy on complex reasoning tasks compared to traditional prompting methods.

Contribution

The paper introduces Program-Aided Language models (PAL), combining LLMs with symbolic execution to enhance reasoning accuracy over existing prompting techniques.

Findings

01

PAL achieves state-of-the-art accuracy on GSM8K benchmark.

02

Code generation with PAL outperforms larger models using chain-of-thought.

03

PAL improves reasoning accuracy across multiple benchmarks.

Abstract

Large language models (LLMs) have recently demonstrated an impressive ability to perform arithmetic and symbolic reasoning tasks, when provided with a few examples at test time ("few-shot prompting"). Much of this success can be attributed to prompting methods such as "chain-of-thought'', which employ LLMs for both understanding the problem description by decomposing it into steps, as well as solving each step of the problem. While LLMs seem to be adept at this sort of step-by-step decomposition, LLMs often make logical and arithmetic mistakes in the solution part, even when the problem is decomposed correctly. In this paper, we present Program-Aided Language models (PAL): a novel approach that uses the LLM to read natural language problems and generate programs as the intermediate reasoning steps, but offloads the solution step to a runtime such as a Python interpreter. With PAL,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

reasoning-machines/gsm-hard
dataset· 3.0k dl
3.0k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsTest