$\text{C}^2\text{P}$: Featuring Large Language Models with Causal Reasoning

Abdolmahdi Bagheri; Matin Alinejad; Mahdi Dehshiri; Kevin Bello; Alireza Akhondi-Asl

arXiv:2407.18069·cs.LO·November 10, 2025·3 cites

$\text{C}^2\text{P}$: Featuring Large Language Models with Causal Reasoning

Abdolmahdi Bagheri, Matin Alinejad, Mahdi Dehshiri, Kevin Bello, Alireza Akhondi-Asl

PDF

Open Access

TL;DR

The paper introduces Causal Chain of Prompting (C^2P), a novel framework that enhances large language models' causal reasoning abilities without external tools, significantly improving their accuracy on synthetic and real-world datasets.

Contribution

C^2P is the first autonomous framework for causal reasoning in LLMs, improving reasoning accuracy by over 30% and demonstrating effectiveness in diverse real-world scenarios.

Findings

01

Reasoning accuracy improved by over 30% on synthetic datasets.

02

Few-shot learning with C^2P increased accuracy by over 20%.

03

Enhanced reasoning performance in real-world story datasets.

Abstract

Causal reasoning is one of the primary bottlenecks that Large Language Models (LLMs) must overcome to attain human-level intelligence. Recent studies indicate that LLMs display near-random performance on reasoning tasks. To address this, we introduce the Causal Chain of Prompting ( $C^{2} P$ ), a reasoning framework that aims to equip current LLMs with causal reasoning capabilities as the first framework of its kind operating autonomously without relying on external tools or modules during both the causal learning and reasoning phases. To evaluate the performance of $C^{2} P$ , we first demonstrate that reasoning accuracy improved by over $30.7%$ and $25.9%$ for GPT-4 Turbo and LLaMA 3.1, respectively, when using our framework, compared to the same models without $C^{2} P$ on a synthetic benchmark dataset. Then, using few-shot learning of the same LLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Data Quality and Management