Causal Discovery with Language Models as Imperfect Experts

Stephanie Long; Alexandre Pich\'e; Valentina Zantedeschi; Tibor; Schuster; Alexandre Drouin

arXiv:2307.02390·cs.AI·July 6, 2023·6 cites

Causal Discovery with Language Models as Imperfect Experts

Stephanie Long, Alexandre Pich\'e, Valentina Zantedeschi, Tibor, Schuster, Alexandre Drouin

PDF

Open Access 1 Repo

TL;DR

This paper investigates how to enhance causal discovery by integrating potentially imperfect expert knowledge, including language models, to improve causal graph identification beyond traditional methods.

Contribution

It introduces strategies for correcting expert-provided causal orientations using consistency checks and demonstrates this approach with a large language model as an imperfect expert.

Findings

01

Language models can serve as imperfect experts for causal discovery.

02

Consistency-based methods improve causal graph accuracy.

03

Real data case study validates the approach.

Abstract

Understanding the causal relationships that underlie a system is a fundamental prerequisite to accurate decision-making. In this work, we explore how expert knowledge can be used to improve the data-driven identification of causal graphs, beyond Markov equivalence classes. In doing so, we consider a setting where we can query an expert about the orientation of causal relationships between variables, but where the expert may provide erroneous information. We propose strategies for amending such expert knowledge based on consistency properties, e.g., acyclicity and conditional independencies in the equivalence class. We then report a case study, on real data, where a large language model is used as an imperfect expert.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

stephlong614/causal-disco
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Topic Modeling · Advanced Graph Neural Networks