Bayesian sample size determination for causal discovery

Federico Castelletti; Guido Consonni

arXiv:2206.00755·stat.ME·June 3, 2022

Bayesian sample size determination for causal discovery

Federico Castelletti, Guido Consonni

PDF

Open Access

TL;DR

This paper introduces a Bayesian approach to determine the optimal sample size for interventional experiments in causal discovery, improving the efficiency and accuracy of identifying causal DAGs from observational and interventional data.

Contribution

It proposes a novel Bayesian experimental design method to calculate the necessary sample size for interventions in causal graph learning, addressing a key gap in existing algorithms.

Findings

01

Method effectively estimates sample sizes for interventions.

02

Enhances causal discovery accuracy with fewer resources.

03

Provides a pre-experimental evaluation framework.

Abstract

Graphical models based on Directed Acyclic Graphs (DAGs) are widely used to answer causal questions across a variety of scientific and social disciplines. However, observational data alone cannot distinguish in general between DAGs representing the same conditional independence assertions (Markov equivalent DAGs); as a consequence the orientation of some edges in the graph remains indeterminate. Interventional data, produced by exogenous manipulations of variables in the network, enhance the process of structure learning because they allow to distinguish among equivalent DAGs, thus sharpening causal inference. Starting from an equivalence class of DAGs, a few procedures have been devised to produce a collection of variables to be manipulated in order to identify a causal DAG. Yet, these algorithmic approaches do not determine the sample size of the interventional data required to obtain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Bioinformatics and Genomic Networks