Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter,, Fei Xia, Ed Chi, Quoc Le, Denny Zhou

TL;DR
This paper demonstrates that chain-of-thought prompting significantly enhances the reasoning capabilities of large language models across various tasks, enabling state-of-the-art performance without additional training.
Contribution
It introduces chain-of-thought prompting as a simple method to elicit reasoning in large language models, showing how reasoning abilities emerge with scale.
Findings
Chain-of-thought prompting improves performance on reasoning tasks.
State-of-the-art accuracy achieved on GSM8K benchmark.
Large models benefit significantly from few exemplars in prompting.
Abstract
We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning. In particular, we show how such reasoning abilities emerge naturally in sufficiently large language models via a simple method called chain of thought prompting, where a few chain of thought demonstrations are provided as exemplars in prompting. Experiments on three large language models show that chain of thought prompting improves performance on a range of arithmetic, commonsense, and symbolic reasoning tasks. The empirical gains can be striking. For instance, prompting a 540B-parameter language model with just eight chain of thought exemplars achieves state of the art accuracy on the GSM8K benchmark of math word problems, surpassing even finetuned GPT-3 with a verifier.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsAttention Is All You Need · Chain-of-thought prompting · Linear Layer · Attention Dropout · Adam · Residual Connection · Layer Normalization · Cosine Annealing · Softmax · Dense Connections
