Langevin Autoencoders for Learning Deep Latent Variable Models

Shohei Taniguchi; Yusuke Iwasawa; Wataru Kumagai; Yutaka Matsuo

arXiv:2209.07036·cs.LG·October 12, 2022

Langevin Autoencoders for Learning Deep Latent Variable Models

Shohei Taniguchi, Yusuke Iwasawa, Wataru Kumagai, Yutaka Matsuo

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Langevin autoencoders and amortized Langevin dynamics, enabling efficient deep latent variable modeling by replacing costly MCMC sampling with encoder updates, and proves their validity as MCMC algorithms.

Contribution

The paper proposes ALD to replace datapoint-wise MCMC with encoder updates, and introduces the Langevin autoencoder, a novel deep latent variable model based on this method.

Findings

01

ALD accurately samples from target posteriors in synthetic datasets.

02

LAE outperforms variational autoencoders in image generation tasks.

03

LAE surpasses existing MCMC-based methods in test likelihood.

Abstract

Markov chain Monte Carlo (MCMC), such as Langevin dynamics, is valid for approximating intractable distributions. However, its usage is limited in the context of deep latent variable models owing to costly datapoint-wise sampling iterations and slow convergence. This paper proposes the amortized Langevin dynamics (ALD), wherein datapoint-wise MCMC iterations are entirely replaced with updates of an encoder that maps observations into latent variables. This amortization enables efficient posterior sampling without datapoint-wise iterations. Despite its efficiency, we prove that ALD is valid as an MCMC algorithm, whose Markov chain has the target posterior as a stationary distribution under mild assumptions. Based on the ALD, we also present a new deep latent variable model named the Langevin autoencoder (LAE). Interestingly, the LAE can be implemented by slightly modifying the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ishohei220/lae
pytorchOfficial

Videos

Langevin Autoencoders for Learning Deep Latent Variable Models· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Machine Learning in Healthcare

MethodsTest