Causal discovery for observational sciences using supervised machine learning
Anne Helby Petersen, Joseph Ramsey, Claus Thorn Ekstr{\o}m, Peter, Spirtes

TL;DR
This paper introduces SLdisco, a supervised machine learning method for causal discovery from observational data that improves accuracy, conservativeness, and performance on small samples compared to existing algorithms.
Contribution
SLdisco is a novel supervised learning approach that addresses limitations of existing causal discovery methods, especially in small samples and complex models.
Findings
SLdisco is more conservative and less sensitive to sample size.
It performs better on small datasets in both simulations and real data.
SLdisco maintains a good balance between informativeness and accuracy.
Abstract
Causal inference can estimate causal effects, but unless data are collected experimentally, statistical analyses must rely on pre-specified causal models. Causal discovery algorithms are empirical methods for constructing such causal models from data. Several asymptotically correct methods already exist, but they generally struggle on smaller samples. Moreover, most methods focus on very sparse causal models, which may not always be a realistic representation of real-life data generating mechanisms. Finally, while causal relationships suggested by the methods often hold true, their claims about causal non-relatedness have high error rates. This non-conservative error tradeoff is not ideal for observational sciences, where the resulting model is directly used to inform causal inference: A causal model with many missing causal relations entails too strong assumptions and may lead to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Advanced Causal Inference Techniques · Statistical Methods and Bayesian Inference
