Unveiling and Causalizing CoT: A Causal Pespective

Jiarun Fu; Lizhong Ding; Hao Li; Pengqi Li; Qiuning Wei; and Xu Chen

arXiv:2502.18239·cs.LG·February 26, 2025

Unveiling and Causalizing CoT: A Causal Pespective

Jiarun Fu, Lizhong Ding, Hao Li, Pengqi Li, Qiuning Wei, and Xu Chen

PDF

Open Access

TL;DR

This paper introduces a causal perspective to understand and improve Chain-of-Thought reasoning in large language models, making reasoning steps both correct and understandable by modeling causality with structural causal models.

Contribution

It unveils the causal mechanism of CoT, defines a causal effect measure, and proposes a causalization algorithm to correct errors and enhance interpretability.

Findings

01

Causal errors in reasoning steps are effectively corrected.

02

Reasoning ability of LLMs is significantly improved.

03

All reasoning steps become correct and understandable.

Abstract

Although Chain-of-Thought (CoT) has achieved remarkable success in enhancing the reasoning ability of large language models (LLMs), the mechanism of CoT remains a ``black box''. Even if the correct answers can frequently be obtained, existing CoTs struggle to make the reasoning understandable to human. In this paper, we unveil and causalize CoT from a causal perspective to ensure both correctness and understandability of all reasoning steps (to the best of our knowledge, the first such). We model causality of CoT via structural causal models (SCM) to unveil the reasoning mechanism of CoT. To measure the causality of CoT, we define the CoT Average Causal Effect (CACE) to test the causal relations between steps. For those steps without causality (wrong or unintelligible steps), we design a role-playing causal query algorithm to causalize these steps, resulting a causalized CoT with all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Mechanics and Applications · Advanced Text Analysis Techniques · Online Learning and Analytics