PixelGAN Autoencoders

Alireza Makhzani; Brendan Frey

arXiv:1706.00531·cs.LG·June 5, 2017·25 cites

PixelGAN Autoencoders

Alireza Makhzani, Brendan Frey

PDF

Open Access 1 Repo

TL;DR

The PixelGAN autoencoder combines a PixelCNN generative model with a GAN-based recognition network to impose priors on latent codes, enabling unsupervised disentanglement and semi-supervised classification.

Contribution

It introduces a novel autoencoder architecture that integrates PixelCNN and GANs to control information decomposition and improve semi-supervised learning.

Findings

01

Different priors lead to various information decompositions.

02

Achieves unsupervised disentanglement of style and content.

03

Competitive semi-supervised classification results on benchmark datasets.

Abstract

In this paper, we describe the "PixelGAN autoencoder", a generative autoencoder in which the generative path is a convolutional autoregressive neural network on pixels (PixelCNN) that is conditioned on a latent code, and the recognition path uses a generative adversarial network (GAN) to impose a prior distribution on the latent code. We show that different priors result in different decompositions of information between the latent code and the autoregressive decoder. For example, by imposing a Gaussian distribution as the prior, we can achieve a global vs. local decomposition, or by imposing a categorical distribution as the prior, we can disentangle the style and content information of images in an unsupervised fashion. We further show how the PixelGAN autoencoder with a categorical prior can be directly used in semi-supervised settings and achieve competitive semi-supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

anonyme20/nips20
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Music and Audio Processing

MethodsSolana Customer Service Number +1-833-534-1729