A Bayesian Hierarchical Score for Structure Learning from Related Data Sets
Laura Azzimonti, Giorgio Corani, Marco Scutari

TL;DR
This paper introduces a Bayesian Hierarchical Dirichlet score for structure learning in Bayesian networks that effectively leverages related data sets, outperforming traditional scores in accuracy and interpretability when data are heterogeneous.
Contribution
The paper proposes a novel Bayesian Hierarchical Dirichlet score that pools information across related data sets, accounting for their differences, and provides a closed-form approximation for efficient computation.
Findings
BHD outperforms BDeu in reconstruction accuracy with multiple data sets.
BHD produces sparser, more interpretable networks.
BHD performs as well as BDeu on homogeneous data.
Abstract
Score functions for learning the structure of Bayesian networks in the literature assume that data are a homogeneous set of observations; whereas it is often the case that they comprise different related, but not homogeneous, data sets collected in different ways. In this paper we propose a new Bayesian Dirichlet score, which we call Bayesian Hierarchical Dirichlet (BHD). The proposed score is based on a hierarchical model that pools information across data sets to learn a single encompassing network structure, while taking into account the differences in their probabilistic structures. We derive a closed-form expression for BHD using a variational approximation of the marginal likelihood, we study the associated computational cost and we evaluate its performance using simulated data. We find that, when data comprise multiple related data sets, BHD outperforms the Bayesian Dirichlet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Bayesian Methods and Mixture Models · Statistical Methods and Inference
