Reliable and Efficient Inference of Bayesian Networks from Sparse Data   by Statistical Learning Theory

Dominik Janzing; Daniel Herrmann

arXiv:cs/0309015·cs.LG·May 23, 2007

Reliable and Efficient Inference of Bayesian Networks from Sparse Data by Statistical Learning Theory

Dominik Janzing, Daniel Herrmann

PDF

Open Access

TL;DR

This paper demonstrates that Bayesian networks with small in-degree can be reliably inferred from sparse data using VC dimension bounds, enabling polynomial-time search and convex optimization for model selection.

Contribution

It provides a theoretical framework for learning simple Bayesian networks from limited data by bounding VC dimension and applying structural risk minimization.

Findings

01

VC dimension bounds enable reliable inference from sparse data

02

Polynomial complexity for searching optimal Bayesian networks with fixed in-degree

03

Convex optimization can find the optimal joint measure for a given graph

Abstract

To learn (statistical) dependencies among random variables requires exponentially large sample size in the number of observed random variables if any arbitrary joint probability distribution can occur. We consider the case that sparse data strongly suggest that the probabilities can be described by a simple Bayesian network, i.e., by a graph with small in-degree \Delta. Then this simple law will also explain further data with high confidence. This is shown by calculating bounds on the VC dimension of the set of those probability measures that correspond to simple graphs. This allows to select networks by structural risk minimization and gives reliability bounds on the error of the estimated joint measure without (in contrast to a previous paper) any prior assumptions on the set of possible joint measures. The complexity for searching the optimal Bayesian networks of in-degree \Delta…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Machine Learning and Data Classification