Chain-of-Thought Augmentation with Logit Contrast for Enhanced Reasoning   in Language Models

Jay Shim; Grant Kruttschnitt; Alyssa Ma; Daniel Kim; Benjamin Chek,; Athul Anand; Kevin Zhu; Sean O'Brien

arXiv:2407.03600·cs.CL·August 28, 2024·3 cites

Chain-of-Thought Augmentation with Logit Contrast for Enhanced Reasoning in Language Models

Jay Shim, Grant Kruttschnitt, Alyssa Ma, Daniel Kim, Benjamin Chek,, Athul Anand, Kevin Zhu, Sean O'Brien

PDF

Open Access

TL;DR

This paper introduces logit contrast techniques to augment chain-of-thought prompting, aiming to improve reasoning and compositional generalization in language models, though further validation across datasets is needed.

Contribution

It proposes input-based contrasting methods inspired by context-aware decoding to enhance reasoning capabilities in language models using chain-of-thought prompting.

Findings

01

Initial improvements in reasoning performance.

02

Potential for better compositional generalization.

03

Further work needed for stability across datasets.

Abstract

Rapidly increasing model scales coupled with steering methods such as chain-of-thought prompting have led to drastic improvements in language model reasoning. At the same time, models struggle with compositional generalization and are far from human performance on many reasoning-based benchmarks. Leveraging the success of chain-of-thought prompting, and also taking inspiration from context-aware decoding (CAD), we explore input-based contrasting methods to further encourage the type of reasoning induced by chain-of-thought prompting. While work remains to stabilize these results across datasets and models, the improvements we find warrant further investigation into input-based steering methods for context-aware reasoning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOpinion Dynamics and Social Influence