Reliable and Efficient Inference of Bayesian Networks from Sparse Data by Statistical Learning Theory
Dominik Janzing, Daniel Herrmann

TL;DR
This paper demonstrates that Bayesian networks with small in-degree can be reliably inferred from sparse data using VC dimension bounds, enabling polynomial-time search and convex optimization for model selection.
Contribution
It provides a theoretical framework for learning simple Bayesian networks from limited data by bounding VC dimension and applying structural risk minimization.
Findings
VC dimension bounds enable reliable inference from sparse data
Polynomial complexity for searching optimal Bayesian networks with fixed in-degree
Convex optimization can find the optimal joint measure for a given graph
Abstract
To learn (statistical) dependencies among random variables requires exponentially large sample size in the number of observed random variables if any arbitrary joint probability distribution can occur. We consider the case that sparse data strongly suggest that the probabilities can be described by a simple Bayesian network, i.e., by a graph with small in-degree \Delta. Then this simple law will also explain further data with high confidence. This is shown by calculating bounds on the VC dimension of the set of those probability measures that correspond to simple graphs. This allows to select networks by structural risk minimization and gives reliability bounds on the error of the estimated joint measure without (in contrast to a previous paper) any prior assumptions on the set of possible joint measures. The complexity for searching the optimal Bayesian networks of in-degree \Delta…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Machine Learning and Data Classification
