How Likely Do LLMs with CoT Mimic Human Reasoning?

Guangsheng Bao; Hongbo Zhang; Cunxiang Wang; Linyi Yang; Yue Zhang

arXiv:2402.16048·cs.CL·December 13, 2024·3 cites

How Likely Do LLMs with CoT Mimic Human Reasoning?

Guangsheng Bao, Hongbo Zhang, Cunxiang Wang, Linyi Yang, Yue Zhang

PDF

Open Access 1 Repo

TL;DR

This paper investigates how chain-of-thought prompting affects LLM reasoning by analyzing the causal structure of their reasoning process compared to humans, revealing deviations and factors influencing reasoning quality.

Contribution

It introduces a causal analysis framework to diagnose LLM reasoning, highlighting how different training techniques impact reasoning structure and identifying limitations of model size increases.

Findings

01

LLMs often deviate from ideal causal reasoning chains

02

In-context learning strengthens causal structure

03

Fine-tuning and reinforcement learning weaken causal structure

Abstract

Chain-of-thought emerges as a promising technique for eliciting reasoning capabilities from Large Language Models (LLMs). However, it does not always improve task performance or accurately represent reasoning processes, leaving unresolved questions about its usage. In this paper, we diagnose the underlying mechanism by comparing the reasoning process of LLMs with humans, using causal analysis to understand the relationships between the problem instruction, reasoning, and the answer in LLMs. Our empirical study reveals that LLMs often deviate from the ideal causal chain, resulting in spurious correlations and potential consistency errors (inconsistent reasoning and answers). We also examine various factors influencing the causal structure, finding that in-context learning with examples strengthens it, while post-training techniques like supervised fine-tuning and reinforcement learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stevenzhb/cot_causal_analysis
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Multi-Agent Systems and Negotiation · Semantic Web and Ontologies