Better Simulations for Validating Causal Discovery with the DAG-Adaptation of the Onion Method
Bryan Andrews, Erich Kummerfeld

TL;DR
This paper introduces the DAG-adaptation of the Onion (DaO) simulation method, which generates more realistic linear causal models by focusing on correlation matrices, aiming to standardize and improve validation of causal discovery algorithms.
Contribution
The paper presents a novel simulation design for causal discovery validation that emphasizes correlation matrix distribution, along with methods for sampling DAGs with specific degree distributions.
Findings
DaO produces more realistic causal models.
Compared to existing methods, DaO offers more consistent validation benchmarks.
Provides open-source implementations in Python and R.
Abstract
The number of artificial intelligence algorithms for learning causal models from data is growing rapidly. Most ``causal discovery'' or ``causal structure learning'' algorithms are primarily validated through simulation studies. However, no widely accepted simulation standards exist and publications often report conflicting performance statistics -- even when only considering publications that simulate data from linear models. In response, several manuscripts have criticized a popular simulation design for validating algorithms in the linear case. We propose a new simulation design for generating linear models for directed acyclic graphs (DAGs): the DAG-adaptation of the Onion (DaO) method. DaO simulations are fundamentally different from existing simulations because they prioritize the distribution of correlation matrices rather than the distribution of linear effects. Specifically,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Risk and Safety Analysis · Bayesian Modeling and Causal Inference
