Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias
Frederic Koehler, Viraj Mehta, Chenghui Zhou, Andrej Risteski

TL;DR
This paper investigates how variational autoencoders (VAEs) learn data supported on low-dimensional manifolds, revealing that linear VAEs recover the true manifold due to implicit bias, while nonlinear VAEs often learn larger, higher-dimensional manifolds.
Contribution
It provides a theoretical analysis of VAE training dynamics, demonstrating the role of implicit bias in recovering the true data manifold in linear cases.
Findings
Linear VAEs recover the true data manifold due to implicit bias.
Nonlinear VAEs tend to learn a higher-dimensional manifold.
Training dynamics explain the success of VAEs in low-dimensional data.
Abstract
Variational Autoencoders are one of the most commonly used generative models, particularly for image data. A prominent difficulty in training VAEs is data that is supported on a lower-dimensional manifold. Recent work by Dai and Wipf (2020) proposes a two-stage training algorithm for VAEs, based on a conjecture that in standard VAE training the generator will converge to a solution with 0 variance which is correctly supported on the ground truth manifold. They gave partial support for that conjecture by showing that some optima of the VAE loss do satisfy this property, but did not analyze the training dynamics. In this paper, we show that for linear encoders/decoders, the conjecture is true-that is the VAE training does recover a generator with support equal to the ground truth manifold-and does so due to an implicit bias of gradient descent rather than merely the VAE loss itself. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks
