Leveraging Large Language Models for Causal Discovery: a Constraint-based, Argumentation-driven Approach

Zihao Li; Fabrizio Russo

arXiv:2602.16481·cs.AI·February 19, 2026

Leveraging Large Language Models for Causal Discovery: a Constraint-based, Argumentation-driven Approach

Zihao Li, Fabrizio Russo

PDF

Open Access

TL;DR

This paper introduces a novel approach that combines large language models with causal assumption-based argumentation to improve causal discovery, achieving state-of-the-art results by integrating semantic priors and independence evidence.

Contribution

It presents a new method leveraging LLMs as imperfect experts within a symbolic reasoning framework for causal discovery, enhancing existing techniques with semantic priors.

Findings

01

Achieved state-of-the-art performance on benchmark datasets.

02

Effectively integrated semantic priors from LLMs with independence evidence.

03

Proposed an evaluation protocol to reduce memorisation bias in LLM-based causal discovery.

Abstract

Causal discovery seeks to uncover causal relations from data, typically represented as causal graphs, and is essential for predicting the effects of interventions. While expert knowledge is required to construct principled causal graphs, many statistical methods have been proposed to leverage observational data with varying formal guarantees. Causal Assumption-based Argumentation (ABA) is a framework that uses symbolic reasoning to ensure correspondence between input constraints and output graphs, while offering a principled way to combine data and expertise. We explore the use of large language models (LLMs) as imperfect experts for Causal ABA, eliciting semantic structural priors from variable names and descriptions and integrating them with conditional-independence evidence. Experiments on standard benchmarks and semantically grounded synthetic graphs demonstrate state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Topic Modeling · Explainable Artificial Intelligence (XAI)