Inversion of Bayesian Networks

Jesse van Oostrum; Peter van Hintum; Nihat Ay

arXiv:2212.10649·cs.LG·November 3, 2023

Inversion of Bayesian Networks

Jesse van Oostrum, Peter van Hintum, Nihat Ay

PDF

Open Access

TL;DR

This paper investigates the conditions under which recognition networks can exactly model true posterior distributions in Bayesian networks, providing both global and local criteria based on probabilistic graphical modeling principles.

Contribution

It establishes necessary and sufficient conditions for recognition networks to perfectly approximate posteriors in Bayesian networks, including new insights into local properties like perfectness.

Findings

01

Global conditions based on d-separation are identified.

02

Local conditions involve the property of perfectness at nodes.

03

Results clarify when recognition networks can exactly model true posteriors.

Abstract

Variational autoencoders and Helmholtz machines use a recognition network (encoder) to approximate the posterior distribution of a generative model (decoder). In this paper we study the necessary and sufficient properties of a recognition network so that it can model the true posterior distribution exactly. These results are derived in the general context of probabilistic graphical modelling / Bayesian networks, for which the network represents a set of conditional independence statements. We derive both global conditions, in terms of d-separation, and local conditions for the recognition network to have the desired qualities. It turns out that for the local conditions the property perfectness (for every node, all parents are joined) plays an important role.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Bayesian Methods and Mixture Models · Neural Networks and Applications