Integrating Large Language Models in Causal Discovery: A Statistical Causal Approach

Masayuki Takayama; Tadahisa Okuda; Thong Pham; Tatsuyoshi Ikenoue; Shingo Fukuma; Shohei Shimizu; Akiyoshi Sannai

arXiv:2402.01454·cs.LG·May 13, 2025·6 cites

Integrating Large Language Models in Causal Discovery: A Statistical Causal Approach

Masayuki Takayama, Tadahisa Okuda, Thong Pham, Tatsuyoshi Ikenoue, Shingo Fukuma, Shohei Shimizu, Akiyoshi Sannai

PDF

Open Access 2 Repos

TL;DR

This paper introduces a novel method combining large language models with statistical causal discovery to improve causal inference, demonstrating enhanced accuracy and potential for diverse scientific applications.

Contribution

It proposes a new approach called statistical causal prompting (SCP) that synthesizes LLM-based knowledge with traditional causal discovery methods, improving results.

Findings

01

LLM-KBCI and augmented SCD approach ground truths better

02

SCD results improve with SCP application

03

Background knowledge from LLM enhances SCD on unseen datasets

Abstract

In practical statistical causal discovery (SCD), embedding domain expert knowledge as constraints into the algorithm is important for reasonable causal models reflecting the broad knowledge of domain experts, despite the challenges in the systematic acquisition of background knowledge. To overcome these challenges, this paper proposes a novel method for causal inference, in which SCD and knowledge-based causal inference (KBCI) with a large language model (LLM) are synthesized through ``statistical causal prompting (SCP)'' for LLMs and prior knowledge augmentation for SCD. The experiments in this work have revealed that the results of LLM-KBCI and SCD augmented with LLM-KBCI approach the ground truths, more than the SCD result without prior knowledge. These experiments have also revealed that the SCD result can be further improved if the LLM undergoes SCP. Furthermore, with an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Bayesian Modeling and Causal Inference

MethodsAttention Is All You Need · Layer Normalization · Absolute Position Encodings · Linear Layer · Byte Pair Encoding · Multi-Head Attention · Residual Connection · Dense Connections · Position-Wise Feed-Forward Layer · Label Smoothing