Tutorial: Deriving the Standard Variational Autoencoder (VAE) Loss   Function

Stephen Odaibo

arXiv:1907.08956·cs.LG·July 23, 2019·47 cites

Tutorial: Deriving the Standard Variational Autoencoder (VAE) Loss Function

Stephen Odaibo

PDF

Open Access

TL;DR

This tutorial provides a detailed derivation of the standard VAE loss function, including the variational lower bound, KL divergence, and assumptions like Gaussian priors, clarifying the mathematical foundations of VAEs.

Contribution

It offers a comprehensive, step-by-step derivation of the VAE loss function, making the mathematical underpinnings explicit and accessible.

Findings

01

Closed-form solution for the KL divergence under Gaussian assumptions

02

Clear derivation from Bayes' theorem to the VAE loss function

03

Enhanced understanding of the variational lower bound in VAEs

Abstract

In Bayesian machine learning, the posterior distribution is typically computationally intractable, hence variational inference is often required. In this approach, an evidence lower bound on the log likelihood of data is maximized during training. Variational Autoencoders (VAE) are one important example where variational inference is utilized. In this tutorial, we derive the variational lower bound loss function of the standard variational autoencoder. We do so in the instance of a gaussian latent prior and gaussian approximate posterior, under which assumptions the Kullback-Leibler term in the variational lower bound has a closed form solution. We derive essentially everything we use along the way; everything from Bayes' theorem to the Kullback-Leibler divergence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis