Better Simulations for Validating Causal Discovery with the   DAG-Adaptation of the Onion Method

Bryan Andrews; Erich Kummerfeld

arXiv:2405.13100·stat.ME·May 24, 2024

Better Simulations for Validating Causal Discovery with the DAG-Adaptation of the Onion Method

Bryan Andrews, Erich Kummerfeld

PDF

Open Access 1 Repo

TL;DR

This paper introduces the DAG-adaptation of the Onion (DaO) simulation method, which generates more realistic linear causal models by focusing on correlation matrices, aiming to standardize and improve validation of causal discovery algorithms.

Contribution

The paper presents a novel simulation design for causal discovery validation that emphasizes correlation matrix distribution, along with methods for sampling DAGs with specific degree distributions.

Findings

01

DaO produces more realistic causal models.

02

Compared to existing methods, DaO offers more consistent validation benchmarks.

03

Provides open-source implementations in Python and R.

Abstract

The number of artificial intelligence algorithms for learning causal models from data is growing rapidly. Most ``causal discovery'' or ``causal structure learning'' algorithms are primarily validated through simulation studies. However, no widely accepted simulation standards exist and publications often report conflicting performance statistics -- even when only considering publications that simulate data from linear models. In response, several manuscripts have criticized a popular simulation design for validating algorithms in the linear case. We propose a new simulation design for generating linear models for directed acyclic graphs (DAGs): the DAG-adaptation of the Onion (DaO) method. DaO simulations are fundamentally different from existing simulations because they prioritize the distribution of correlation matrices rather than the distribution of linear effects. Specifically,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bja43/dao_simulation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Risk and Safety Analysis · Bayesian Modeling and Causal Inference