Counterfactual Causal Inference in Natural Language with Large Language Models
Ga\"el Gendron, Jo\v{z}e M. Ro\v{z}anec, Michael Witbrock, Gillian, Dobbie

TL;DR
This paper explores using Large Language Models to extract causal graphs from natural language texts and perform counterfactual causal inference, addressing challenges of unstructured data and proposing an end-to-end methodology.
Contribution
It introduces a novel end-to-end method for causal structure discovery and counterfactual inference from natural language using LLMs, including graph merging and bias reduction techniques.
Findings
LLMs can extract causal variables from text for graph construction
Merging graphs from multiple sources improves causal representation
Counterfactual inference performance is limited by prediction errors
Abstract
Causal structure discovery methods are commonly applied to structured data where the causal variables are known and where statistical testing can be used to assess the causal relationships. By contrast, recovering a causal structure from unstructured natural language data such as news articles contains numerous challenges due to the absence of known variables or counterfactual data to estimate the causal links. Large Language Models (LLMs) have shown promising results in this direction but also exhibit limitations. This work investigates LLM's abilities to build causal graphs from text documents and perform counterfactual causal inference. We propose an end-to-end causal structure discovery and causal inference method from natural language: we first use an LLM to extract the instantiated causal variables from text data and build a causal graph. We merge causal graphs from multiple data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsSparse Evolutionary Training · Causal inference
