Contrastive Chain-of-Thought Prompting
Yew Ken Chia, Guizhen Chen, Luu Anh Tuan, Soujanya Poria, Lidong Bing

TL;DR
This paper introduces contrastive chain-of-thought prompting, which uses both correct and incorrect reasoning examples to improve language model reasoning and reduce errors, demonstrating broad effectiveness across benchmarks.
Contribution
The paper proposes a novel contrastive prompting method that incorporates invalid reasoning demonstrations to enhance reasoning accuracy and generalization in language models.
Findings
Improves reasoning accuracy across benchmarks
Reduces reasoning mistakes by guiding models away from errors
Enhances generalization of chain-of-thought prompting
Abstract
Despite the success of chain of thought in enhancing language model reasoning, the underlying process remains less well understood. Although logically sound reasoning appears inherently crucial for chain of thought, prior studies surprisingly reveal minimal impact when using invalid demonstrations instead. Furthermore, the conventional chain of thought does not inform language models on what mistakes to avoid, which potentially leads to more errors. Hence, inspired by how humans can learn from both positive and negative examples, we propose contrastive chain of thought to enhance language model reasoning. Compared to the conventional chain of thought, our approach provides both valid and invalid reasoning demonstrations, to guide the model to reason step-by-step while reducing reasoning mistakes. To improve generalization, we introduce an automatic method to construct contrastive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
