Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Jason Wei; Xuezhi Wang; Dale Schuurmans; Maarten Bosma; Brian Ichter,; Fei Xia; Ed Chi; Quoc Le; Denny Zhou

arXiv:2201.11903·cs.CL·January 12, 2023·4.2k cites

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter,, Fei Xia, Ed Chi, Quoc Le, Denny Zhou

PDF

Open Access 5 Repos 4 Models 5 Datasets 1 Video

TL;DR

This paper demonstrates that chain-of-thought prompting significantly enhances the reasoning capabilities of large language models across various tasks, enabling state-of-the-art performance without additional training.

Contribution

It introduces chain-of-thought prompting as a simple method to elicit reasoning in large language models, showing how reasoning abilities emerge with scale.

Findings

01

Chain-of-thought prompting improves performance on reasoning tasks.

02

State-of-the-art accuracy achieved on GSM8K benchmark.

03

Large models benefit significantly from few exemplars in prompting.

Abstract

We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning. In particular, we show how such reasoning abilities emerge naturally in sufficiently large language models via a simple method called chain of thought prompting, where a few chain of thought demonstrations are provided as exemplars in prompting. Experiments on three large language models show that chain of thought prompting improves performance on a range of arithmetic, commonsense, and symbolic reasoning tasks. The empirical gains can be striking. For instance, prompting a 540B-parameter language model with just eight chain of thought exemplars achieves state of the art accuracy on the GSM8K benchmark of math word problems, surpassing even finetuned GPT-3 with a verifier.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

Chain-of-Thought Prompting Elicits Reasoning in Large Language Models· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsAttention Is All You Need · Chain-of-thought prompting · Linear Layer · Attention Dropout · Adam · Residual Connection · Layer Normalization · Cosine Annealing · Softmax · Dense Connections