Generative Models with ELBOs Converging to Entropy Sums
Jan Warnken, Dmytro Velychko, Simon Damm, Asja Fischer, J\"org, L\"ucke

TL;DR
This paper proves that the ELBOs of various generative models converge to sums of entropies, providing a unified theoretical understanding of their behavior at stationary points.
Contribution
It establishes convergence of ELBOs to entropy sums for multiple generative models, including complex classes like exponential family mixtures, with rigorous proofs.
Findings
ELBOs of several models equal entropy sums at stationary points
Convergence holds under realistic conditions such as finite data and model mismatches
Includes proofs for models like probabilistic PCA, sigmoid belief nets, and Gaussian mixtures
Abstract
The evidence lower bound (ELBO) is one of the most central objectives for probabilistic unsupervised learning. For the ELBOs of several generative models and model classes, we here prove convergence to entropy sums. As one result, we provide a list of generative models for which entropy convergence has been shown, so far, along with the corresponding expressions for entropy sums. Our considerations include very prominent generative models such as probabilistic PCA, sigmoid belief nets or Gaussian mixture models. However, we treat more models and entire model classes such as general mixtures of exponential family distributions. Our main contributions are the proofs for the individual models. For each given model we show that the conditions stated in Theorem 1 or Theorem 2 of [arXiv:2209.03077] are fulfilled such that by virtue of the theorems the given model's ELBO is equal to an entropy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEconomic theories and models · Stochastic processes and financial applications
MethodsPrincipal Components Analysis
