Unveiling Confirmation Bias in Chain-of-Thought Reasoning
Yue Wan, Xiaowei Jia, Xiang Lorraine Li

TL;DR
This paper investigates confirmation bias in chain-of-thought reasoning of large language models, revealing how internal beliefs influence reasoning and answer prediction, and highlighting the need for strategies to mitigate bias for improved performance.
Contribution
It introduces a cognitive psychology perspective to analyze confirmation bias in LLMs' reasoning, providing empirical evidence and insights into how beliefs affect reasoning and answer prediction.
Findings
Confirmation bias affects reasoning generation and answer prediction in LLMs.
Model beliefs influence the utilization of rationales in reasoning tasks.
Task vulnerability to bias correlates with reasoning effectiveness across models.
Abstract
Chain-of-thought (CoT) prompting has been widely adopted to enhance the reasoning capabilities of large language models (LLMs). However, the effectiveness of CoT reasoning is inconsistent across tasks with different reasoning types. This work presents a novel perspective to understand CoT behavior through the lens of \textit{confirmation bias} in cognitive psychology. Specifically, we examine how model internal beliefs, approximated by direct question-answering probabilities, affect both reasoning generation () and reasoning-guided answer prediction () in CoT. By decomposing CoT into a two-stage process, we conduct a thorough correlation analysis in model beliefs, rationale attributes, and stage-wise performance. Our results provide strong evidence of confirmation bias in LLMs, such that model beliefs not only skew the reasoning process but also influence how…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Semantic Web and Ontologies · Logic, Reasoning, and Knowledge
