CARE: Turning LLMs Into Causal Reasoning Expert
Juncheng Dong, Yiling Liu, Ahmed Aloui, Vahid Tarokh, David Carlson

TL;DR
This paper introduces CARE, a framework that fine-tunes LLMs to improve their causal reasoning by effectively integrating outputs from established causal discovery algorithms, surpassing existing methods in accuracy.
Contribution
The paper presents a novel fine-tuning approach that enhances LLMs' causal reasoning by teaching them to utilize external causal discovery outputs, addressing their inherent limitations.
Findings
Fine-tuned Qwen2.5-1.5B outperforms larger models and traditional algorithms.
Prompting with causal discovery outputs initially decreases LLM performance.
CARE significantly improves LLMs' ability to perform causal reasoning.
Abstract
Large language models (LLMs) have recently demonstrated impressive capabilities across a range of reasoning and generation tasks. However, research studies have shown that LLMs lack the ability to identify causal relationships, a fundamental cornerstone of human intelligence. We first conduct an exploratory investigation of LLMs' behavior when asked to perform a causal-discovery task and find that they mostly rely on the semantic meaning of variable names, ignoring the observation data. This is unsurprising, given that LLMs were never trained to process structural datasets. To first tackle this challenge, we prompt the LLMs with the outputs of established causal discovery algorithms designed for observational datasets. These algorithm outputs effectively serve as the sufficient statistics of the observation data. However, quite surprisingly, we find that prompting the LLMs with these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications
