On the Convergence of the ELBO to Entropy Sums
J\"org L\"ucke, Jan Warnken

TL;DR
This paper proves that for a broad class of generative models, the variational lower bound at stationary points equals a sum of three entropies, providing a new theoretical insight into the nature of ELBO convergence.
Contribution
It establishes that the ELBO at stationary points equals a sum of entropies for many models, including variational autoencoders, under mild conditions.
Findings
ELBO equals a sum of entropies at stationary points.
The result applies to models with exponential family distributions.
The proof covers finite data points and saddle points.
Abstract
The variational lower bound (a.k.a. ELBO or free energy) is the central objective for many established as well as for many novel algorithms for unsupervised learning. Such algorithms usually increase the bound until parameters have converged to values close to a stationary point of the learning dynamics. Here we show that (for a very large class of generative models) the variational lower bound is at all stationary points of learning equal to a sum of entropies. Concretely, for standard generative models with one set of latents and one set of observed variables, the sum consists of three entropies: (A) the (average) entropy of the variational distributions, (B) the negative entropy of the model's prior distribution, and (C) the (expected) negative entropy of the observable distribution. The obtained result applies under realistic conditions including: finite numbers of data points, at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Neural Networks and Applications · Machine Learning in Materials Science
MethodsPrincipal Components Analysis · Balanced Selection
