Variational autoencoders in the presence of low-dimensional data:   landscape and implicit bias

Frederic Koehler; Viraj Mehta; Chenghui Zhou; Andrej Risteski

arXiv:2112.06868·cs.LG·May 19, 2022

Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias

Frederic Koehler, Viraj Mehta, Chenghui Zhou, Andrej Risteski

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper investigates how variational autoencoders (VAEs) learn data supported on low-dimensional manifolds, revealing that linear VAEs recover the true manifold due to implicit bias, while nonlinear VAEs often learn larger, higher-dimensional manifolds.

Contribution

It provides a theoretical analysis of VAE training dynamics, demonstrating the role of implicit bias in recovering the true data manifold in linear cases.

Findings

01

Linear VAEs recover the true data manifold due to implicit bias.

02

Nonlinear VAEs tend to learn a higher-dimensional manifold.

03

Training dynamics explain the success of VAEs in low-dimensional data.

Abstract

Variational Autoencoders are one of the most commonly used generative models, particularly for image data. A prominent difficulty in training VAEs is data that is supported on a lower-dimensional manifold. Recent work by Dai and Wipf (2020) proposes a two-stage training algorithm for VAEs, based on a conjecture that in standard VAE training the generator will converge to a solution with 0 variance which is correctly supported on the ground truth manifold. They gave partial support for that conjecture by showing that some optima of the VAE loss do satisfy this property, but did not analyze the training dynamics. In this paper, we show that for linear encoders/decoders, the conjecture is true-that is the VAE training does recover a generator with support equal to the ground truth manifold-and does so due to an implicit bias of gradient descent rather than merely the VAE loss itself. In…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

virajmehta/vae-training
jaxOfficial

Videos

Variational autoencoders in the presence of low-dimensional data: landscape and implicit bias· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks