Efficient Gradient-Based Inference through Transformations between Bayes   Nets and Neural Nets

Diederik P. Kingma; Max Welling

arXiv:1402.0480·cs.LG·January 23, 2015·39 cites

Efficient Gradient-Based Inference through Transformations between Bayes Nets and Neural Nets

Diederik P. Kingma, Max Welling

PDF

Open Access

TL;DR

This paper demonstrates how hierarchical Bayesian networks and neural networks can be transformed into each other through parameterization choices, improving the efficiency and robustness of gradient-based inference.

Contribution

It introduces a method to switch between centered and non-centered parameterizations, enhancing inference efficiency and robustness in Bayesian and neural network models.

Findings

01

Transformations enable switching between Bayesian and neural network models.

02

Non-centered parameterization allows simple Monte Carlo estimation of marginal likelihood.

03

Theoretical insights are validated through experiments.

Abstract

Hierarchical Bayesian networks and neural networks with stochastic hidden units are commonly perceived as two separate types of models. We show that either of these types of models can often be transformed into an instance of the other, by switching between centered and differentiable non-centered parameterizations of the latent variables. The choice of parameterization greatly influences the efficiency of gradient-based posterior inference; we show that they are often complementary to eachother, we clarify when each parameterization is preferred and show how inference can be made robust. In the non-centered form, a simple Monte Carlo estimator of the marginal likelihood can be used for learning the parameters. Theoretical results are supported by experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Neural Networks and Applications · Bayesian Modeling and Causal Inference