KL Divergence Between Gaussians: A Step-by-Step Derivation for the Variational Autoencoder Objective

Andr\'es Mu\~noz; Rodrigo Ramele

arXiv:2604.11744·cs.LG·April 14, 2026

KL Divergence Between Gaussians: A Step-by-Step Derivation for the Variational Autoencoder Objective

Andr\'es Mu\~noz, Rodrigo Ramele

PDF

TL;DR

This paper provides a detailed derivation of the closed-form KL divergence between Gaussian distributions, crucial for understanding and implementing the VAE objective.

Contribution

It offers a step-by-step derivation of the Gaussian KL divergence formula, clarifying its components and implications for VAE training.

Findings

01

Derived the univariate and multivariate Gaussian KL divergence formulas

02

Clarified the interpretation of each term in the divergence expression

03

Discussed the impact on VAE training dynamics

Abstract

Kullback-Leibler (KL) divergence is a fundamental concept in information theory that quantifies the discrepancy between two probability distributions. In the context of Variational Autoencoders (VAEs), it serves as a central regularization term, imposing structure on the latent space and thereby enabling the model to exhibit generative capabilities. In this work, we present a detailed derivation of the closed-form expression for the KL divergence between Gaussian distributions, a case of particular importance in practical VAE implementations. Starting from the general definition for continuous random variables, we derive the expression for the univariate case and extend it to the multivariate setting under the assumption of diagonal covariance. Finally, we discuss the interpretation of each term in the resulting expression and its impact on the training dynamics of the model.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.