$\text{C}^2\text{P}$: Featuring Large Language Models with Causal Reasoning
Abdolmahdi Bagheri, Matin Alinejad, Mahdi Dehshiri, Kevin Bello, Alireza Akhondi-Asl

TL;DR
The paper introduces Causal Chain of Prompting (C^2P), a novel framework that enhances large language models' causal reasoning abilities without external tools, significantly improving their accuracy on synthetic and real-world datasets.
Contribution
C^2P is the first autonomous framework for causal reasoning in LLMs, improving reasoning accuracy by over 30% and demonstrating effectiveness in diverse real-world scenarios.
Findings
Reasoning accuracy improved by over 30% on synthetic datasets.
Few-shot learning with C^2P increased accuracy by over 20%.
Enhanced reasoning performance in real-world story datasets.
Abstract
Causal reasoning is one of the primary bottlenecks that Large Language Models (LLMs) must overcome to attain human-level intelligence. Recent studies indicate that LLMs display near-random performance on reasoning tasks. To address this, we introduce the Causal Chain of Prompting (), a reasoning framework that aims to equip current LLMs with causal reasoning capabilities as the first framework of its kind operating autonomously without relying on external tools or modules during both the causal learning and reasoning phases. To evaluate the performance of , we first demonstrate that reasoning accuracy improved by over and for GPT-4 Turbo and LLaMA 3.1, respectively, when using our framework, compared to the same models without on a synthetic benchmark dataset. Then, using few-shot learning of the same LLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Data Quality and Management
