From Points to Spheres: A Geometric Reinterpretation of Variational Autoencoders
Songxuan Shi

TL;DR
This paper offers a new geometric perspective on Variational Autoencoders, viewing latent spaces as Gaussian balls constrained by KL divergence, which enhances understanding of their generative capabilities.
Contribution
It introduces a geometric reinterpretation of VAEs, connecting them with VQ-VAE and emphasizing the role of latent space geometry in generative modeling.
Findings
Latent representations form Gaussian balls influenced by KL divergence.
Reparameterization acts as a contractual mechanism between encoder and decoder.
VQ-VAE is viewed as a constrained autoencoder with cluster centers.
Abstract
Variational Autoencoder is typically understood from the perspective of probabilistic inference. In this work, we propose a new geometric reinterpretation which complements the probabilistic view and enhances its intuitiveness. We demonstrate that the proper construction of semantic manifolds arises primarily from the constraining effect of the KL divergence on the encoder. We view the latent representations as a Gaussian ball rather than deterministic points. Under the constraint of KL divergence, Gaussian ball regularizes the latent space, promoting a more uniform distribution of encodings. Furthermore, we show that reparameterization establishes a critical contractual mechanism between the encoder and decoder, enabling the decoder to learn how to reconstruct from these stochastic regions. We further connect this viewpoint with VQ-VAE, offering a unified perspective: VQ-VAE can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Gaussian Processes and Bayesian Inference
