There and Back Again: Unraveling the Variational Auto-Encoder
Graham Fyffe

TL;DR
This paper proves that variational auto-encoders can have constant posterior variances without loss of optimality, introduces a simplified ELBO formulation called BILBO, and proposes a scale-invariant likelihood method called BAGGINS, improving VAE performance.
Contribution
It demonstrates that VAEs can operate with fixed posterior variances, introduces BILBO for simplified ELBO computation, and proposes BAGGINS for scale-invariant likelihood modeling.
Findings
Constant posterior variances are sufficient under mild conditions.
BILBO simplifies ELBO as a batch expectation, applicable broadly.
BAGGINS improves stability and scale invariance in VAEs.
Abstract
We prove that the evidence lower bound (ELBO) employed by variational auto-encoders (VAEs) admits non-trivial solutions having constant posterior variances under certain mild conditions, removing the need to learn variances in the encoder. The proof follows from an unexpected journey through an array of topics: the closed form optimal decoder for Gaussian VAEs, a proof that the decoder is always smooth, a proof that the ELBO at its stationary points is equal to the exact log evidence, and the posterior variance is merely part of a stochastic estimator of the decoder Hessian. The penalty incurred from using a constant posterior variance is small under mild conditions, and otherwise discourages large variations in the decoder Hessian. From here we derive a simplified formulation of the ELBO as an expectation over a batch, which we call the Batch Information Lower Bound (BILBO). Despite…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference · Adversarial Robustness in Machine Learning
