VertiBayes: Learning Bayesian network parameters from vertically partitioned data with missing values
Florian van Daalen, Lianne Ippel, Andre Dekker, Inigo Bermejo

TL;DR
VertiBayes introduces a novel federated learning method for Bayesian networks that effectively learns from vertically partitioned data with missing values, ensuring privacy and model accuracy.
Contribution
The paper presents VertiBayes, a new approach for learning Bayesian network parameters and structure from vertically partitioned data with missing values, incorporating privacy-preserving protocols.
Findings
Models are comparable to traditional algorithms in accuracy.
The approach handles missing data effectively.
Estimated complexity increases with data and network size.
Abstract
Federated learning makes it possible to train a machine learning model on decentralized data. Bayesian networks are probabilistic graphical models that have been widely used in artificial intelligence applications. Their popularity stems from the fact they can be built by combining existing expert knowledge with data and are highly interpretable, which makes them useful for decision support, e.g. in healthcare. While some research has been published on the federated learning of Bayesian networks, publications on Bayesian networks in a vertically partitioned or heterogeneous data setting (where different variables are located in different datasets) are limited, and suffer from important omissions, such as the handling of missing data. In this article, we propose a novel method called VertiBayes to train Bayesian networks (structure and parameters) on vertically partitioned data, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Data Quality and Management · Privacy-Preserving Technologies in Data
