Automatic Chain of Thought Prompting in Large Language Models
Zhuosheng Zhang, Aston Zhang, Mu Li, Alex Smola

TL;DR
Auto-CoT automatically generates diverse reasoning demonstrations for large language models, eliminating manual effort and achieving performance comparable or superior to manually crafted prompts across multiple reasoning benchmarks.
Contribution
The paper introduces Auto-CoT, an automatic method for generating reasoning demonstrations that leverages diversity sampling, reducing manual effort in chain-of-thought prompting.
Findings
Auto-CoT matches or exceeds manual CoT performance on ten benchmarks.
Diversity sampling improves reasoning chain quality.
Auto-CoT reduces manual prompt engineering effort.
Abstract
Large language models (LLMs) can perform complex reasoning by generating intermediate reasoning steps. Providing these steps for prompting demonstrations is called chain-of-thought (CoT) prompting. CoT prompting has two major paradigms. One leverages a simple prompt like "Let's think step by step" to facilitate step-by-step thinking before answering a question. The other uses a few manual demonstrations one by one, each composed of a question and a reasoning chain that leads to an answer. The superior performance of the second paradigm hinges on the hand-crafting of task-specific demonstrations one by one. We show that such manual efforts may be eliminated by leveraging LLMs with the "Let's think step by step" prompt to generate reasoning chains for demonstrations one by one, i.e., let's think not just step by step, but also one by one. However, these generated chains often come with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Chain-of-thought prompting · Linear Layer · Byte Pair Encoding · Layer Normalization · Cosine Annealing · Residual Connection · Dropout · Weight Decay
