Learning Bayesian Networks with Incomplete Data by Augmentation

Tameem Adel; Cassio P. de Campos

arXiv:1608.07734·cs.AI·December 6, 2016

Learning Bayesian Networks with Incomplete Data by Augmentation

Tameem Adel, Cassio P. de Campos

PDF

TL;DR

This paper introduces novel algorithms for learning Bayesian networks from incomplete data by transforming the problem into a standard learning task, including an exact method and a scalable approximate approach, validated through extensive experiments.

Contribution

It presents the first exact algorithm for Bayesian network learning with missing data and develops a scalable approximate method based on data augmentation.

Findings

01

Exact algorithm successfully recasts the problem into standard Bayesian network learning.

02

Approximate algorithm scales to large domains with suitable structure learning methods.

03

Experiments demonstrate the effectiveness of the new approach.

Abstract

We present new algorithms for learning Bayesian networks from data with missing values using a data augmentation approach. An exact Bayesian network learning algorithm is obtained by recasting the problem into a standard Bayesian network learning problem without missing data. To the best of our knowledge, this is the first exact algorithm for this problem. As expected, the exact algorithm does not scale to large domains. We build on the exact method to create an approximate algorithm using a hill-climbing technique. This algorithm scales to large domains so long as a suitable standard structure learning method for complete data is available. We perform a wide range of experiments to demonstrate the benefits of learning Bayesian networks with such new approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.