Variational Lossy Autoencoder

Xi Chen; Diederik P. Kingma; Tim Salimans; Yan Duan; Prafulla; Dhariwal; John Schulman; Ilya Sutskever; Pieter Abbeel

arXiv:1611.02731·cs.LG·March 7, 2017·257 cites

Variational Lossy Autoencoder

Xi Chen, Diederik P. Kingma, Tim Salimans, Yan Duan, Prafulla, Dhariwal, John Schulman, Ilya Sutskever, Pieter Abbeel

PDF

Open Access

TL;DR

This paper introduces a variational autoencoder framework combined with autoregressive models to learn global, lossy representations of data, improving generative performance and enabling control over the information captured in the latent space.

Contribution

It proposes a novel VAE architecture that leverages autoregressive models for better control of learned representations and enhanced generative modeling.

Findings

01

Achieved state-of-the-art results on MNIST, OMNIGLOT, and Caltech-101 Silhouettes.

02

Demonstrated control over the type of information encoded in the latent space.

03

Improved generative modeling performance with autoregressive priors and decoders.

Abstract

Representation learning seeks to expose certain aspects of observed data in a learned representation that's amenable to downstream tasks like classification. For instance, a good representation for 2D images might be one that describes only global structure and discards information about detailed texture. In this paper, we present a simple but principled method to learn such global representations by combining Variational Autoencoder (VAE) with neural autoregressive models such as RNN, MADE and PixelRNN/CNN. Our proposed VAE model allows us to have control over what the global latent code can learn and , by designing the architecture accordingly, we can force the global latent code to discard irrelevant information such as texture in 2D images, and hence the VAE only "autoencodes" data in a lossy fashion. In addition, by leveraging autoregressive models as both prior distribution $p (z)$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition

MethodsSolana Customer Service Number +1-833-534-1729 · USD Coin Customer Service Number +1-833-534-1729