LLM-initialized Differentiable Causal Discovery
Shiv Kampani, David Hidary, Constantijn van der Poel, Martin Ganahl,, Brenda Miao

TL;DR
This paper introduces LLM-DCD, a novel causal discovery method that leverages large language models for initialization, improving interpretability and accuracy in uncovering causal relationships from observational data.
Contribution
The paper proposes a new approach that integrates LLMs with differentiable causal discovery by using LLMs for initialization, enhancing interpretability and performance.
Findings
Higher accuracy on benchmark datasets compared to state-of-the-art methods.
Initialization quality significantly affects final causal discovery results.
Explicit adjacency matrix optimization improves interpretability.
Abstract
The discovery of causal relationships between random variables is an important yet challenging problem that has applications across many scientific domains. Differentiable causal discovery (DCD) methods are effective in uncovering causal relationships from observational data; however, these approaches often suffer from limited interpretability and face challenges in incorporating domain-specific prior knowledge. In contrast, Large Language Models (LLMs)-based causal discovery approaches have recently been shown capable of providing useful priors for causal discovery but struggle with formal causal reasoning. In this paper, we propose LLM-DCD, which uses an LLM to initialize the optimization of the maximum likelihood objective function of DCD approaches, thereby incorporating strong priors into the discovery method. To achieve this initialization, we design our objective function to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Bayesian Modeling and Causal Inference · Fault Detection and Control Systems
