Large Language Models are Zero-Shot Reasoners
Takeshi Kojima, Shixiang Shane Gu, Machel Reid, Yutaka Matsuo, Yusuke, Iwasawa

TL;DR
Large language models can perform complex reasoning tasks in a zero-shot setting by adding a simple prompt, significantly improving their performance across various benchmarks without needing few-shot examples.
Contribution
Demonstrates that adding 'Let's think step by step' enables LLMs to excel at reasoning tasks in a zero-shot manner, establishing a strong baseline and revealing untapped capabilities.
Findings
Zero-shot-CoT improves accuracy on arithmetic tasks from 17.7% to 78.7%.
Significant performance gains on diverse reasoning benchmarks.
Simple prompting reveals broad cognitive abilities in LLMs.
Abstract
Pretrained large language models (LLMs) are widely used in many sub-fields of natural language processing (NLP) and generally known as excellent few-shot learners with task-specific exemplars. Notably, chain of thought (CoT) prompting, a recent technique for eliciting complex multi-step reasoning through step-by-step answer examples, achieved the state-of-the-art performances in arithmetics and symbolic reasoning, difficult system-2 tasks that do not follow the standard scaling laws for LLMs. While these successes are often attributed to LLMs' ability for few-shot learning, we show that LLMs are decent zero-shot reasoners by simply adding "Let's think step by step" before each answer. Experimental results demonstrate that our Zero-shot-CoT, using the same single prompt template, significantly outperforms zero-shot LLM performances on diverse benchmark reasoning tasks including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗SeaLLMs/SeaLLM-7B-v2model· 8.4k dl· ♡ 688.4k dl♡ 68
- 🤗LoneStriker/SeaLLM-7B-v2-GGUFmodel· 169 dl· ♡ 6169 dl♡ 6
- 🤗LoneStriker/SeaLLM-7B-v2-3.0bpw-h6-exl2model· 1 dl1 dl
- 🤗LoneStriker/SeaLLM-7B-v2-4.0bpw-h6-exl2model· 4 dl4 dl
- 🤗LoneStriker/SeaLLM-7B-v2-5.0bpw-h6-exl2model· 5 dl5 dl
- 🤗LoneStriker/SeaLLM-7B-v2-6.0bpw-h6-exl2model· 1 dl1 dl
- 🤗LoneStriker/SeaLLM-7B-v2-8.0bpw-h8-exl2model· 4 dl4 dl
- 🤗LoneStriker/SeaLLM-7B-v2-AWQmodel· 5 dl5 dl
- 🤗SeaLLMs/SeaLLM-7B-v2-ggufmodel· 53 dl· ♡ 953 dl♡ 9
- 🤗QuantFactory/SeaLLM-7B-v2-GGUFmodel· 121 dl· ♡ 1121 dl♡ 1
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsPathways Language Model
