Challenges and Considerations in the Evaluation of Bayesian Causal Discovery
Amir Mohammad Karimi Mamaghan, Panagiotis Tigas, Karl Henrik, Johansson, Yarin Gal, Yashas Annadani, Stefan Bauer

TL;DR
This paper critically examines the evaluation metrics for Bayesian Causal Discovery, revealing their limitations in accurately assessing posterior quality, especially with low data or high uncertainty, and emphasizes the need for improved evaluation methods.
Contribution
The study provides an extensive empirical analysis of existing metrics for BCD evaluation, highlighting their shortcomings and proposing considerations for developing better assessment tools.
Findings
Many metrics do not correlate well with true posterior quality.
Current metrics are less effective with high entropy posteriors.
Evaluation challenges are influenced by data size and model identifiability.
Abstract
Representing uncertainty in causal discovery is a crucial component for experimental design, and more broadly, for safe and reliable causal decision making. Bayesian Causal Discovery (BCD) offers a principled approach to encapsulating this uncertainty. Unlike non-Bayesian causal discovery, which relies on a single estimated causal graph and model parameters for assessment, evaluating BCD presents challenges due to the nature of its inferred quantity - the posterior distribution. As a result, the research community has proposed various metrics to assess the quality of the approximate posterior. However, there is, to date, no consensus on the most suitable metric(s) for evaluation. In this work, we reexamine this question by dissecting various metrics and understanding their limitations. Through extensive empirical evaluation, we find that many existing metrics fail to exhibit a strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference
