Break the Chain: Large Language Models Can be Shortcut Reasoners

Mengru Ding; Hanmeng Liu; Zhizhang Fu; Jian Song; Wenbo Xie; Yue Zhang

arXiv:2406.06580·cs.CL·June 12, 2024

Break the Chain: Large Language Models Can be Shortcut Reasoners

Mengru Ding, Hanmeng Liu, Zhizhang Fu, Jian Song, Wenbo Xie, Yue Zhang

PDF

Open Access

TL;DR

This paper introduces 'break the chain' strategies that enable large language models to use shortcuts for reasoning, reducing complexity and improving efficiency in complex logical and commonsense tasks.

Contribution

It proposes novel zero-shot prompting methods and a new dataset, ShortcutQA, to evaluate shortcut reasoning in language models, advancing beyond traditional chain-of-thought approaches.

Findings

01

Models effectively use shortcuts with 'break the chain' strategies

02

ShortcutQA challenges models' reasoning abilities

03

Shortcuts improve reasoning speed and efficiency

Abstract

Recent advancements in Chain-of-Thought (CoT) reasoning utilize complex modules but are hampered by high token consumption, limited applicability, and challenges in reproducibility. This paper conducts a critical evaluation of CoT prompting, extending beyond arithmetic to include complex logical and commonsense reasoning tasks, areas where standard CoT methods fall short. We propose the integration of human-like heuristics and shortcuts into language models (LMs) through "break the chain" strategies. These strategies disrupt traditional CoT processes using controlled variables to assess their efficacy. Additionally, we develop innovative zero-shot prompting strategies that encourage the use of shortcuts, enabling LMs to quickly exploit reasoning clues and bypass detailed procedural steps. Our comprehensive experiments across various LMs, both commercial and open-source, reveal that LMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques