First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning
Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Shusaku Sone, Masaya, Taniguchi, Keisuke Sakaguchi, Kentaro Inui

TL;DR
This paper investigates how language models dynamically switch between heuristic and rational strategies during multi-step reasoning, revealing a shift from heuristic reliance early on to more rational processing near the conclusion.
Contribution
It uncovers the dynamic use of heuristics versus rational strategies in language model reasoning, highlighting the limited backtracking ability and adaptive strategy combination.
Findings
LMs rely more on heuristics early in reasoning
Reliance on heuristics decreases as reasoning progresses
Limited backtracking capability in multi-step reasoning
Abstract
Multi-step reasoning instruction, such as chain-of-thought prompting, is widely adopted to explore better language models (LMs) performance. We report on the systematic strategy that LMs employ in such a multi-step reasoning process. Our controlled experiments reveal that LMs rely more heavily on heuristics, such as lexical overlap, in the earlier stages of reasoning, where more reasoning steps remain to reach a goal. Conversely, their reliance on heuristics decreases as LMs progress closer to the final answer through multiple reasoning steps. This suggests that LMs can backtrack only a limited number of future steps and dynamically combine heuristic strategies with rationale ones in tasks involving multi-step reasoning.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Multi-Agent Systems and Negotiation · Speech and dialogue systems
