Understanding disentangling in $\beta$-VAE

Christopher P. Burgess; Irina Higgins; Arka Pal; Loic Matthey; Nick; Watters; Guillaume Desjardins; Alexander Lerchner

arXiv:1804.03599·stat.ML·April 11, 2018·279 cites

Understanding disentangling in $\beta$-VAE

Christopher P. Burgess, Irina Higgins, Arka Pal, Loic Matthey, Nick, Watters, Guillaume Desjardins, Alexander Lerchner

PDF

Open Access 5 Repos 1 Datasets

TL;DR

This paper offers new theoretical insights into how disentangled representations emerge in $eta$-VAE, proposing a training modification that improves disentanglement without sacrificing reconstruction quality.

Contribution

It introduces a rate-distortion perspective to understand disentanglement and proposes a progressive capacity increase method for better $eta$-VAE training.

Findings

01

Disentanglement emerges under specific rate-distortion conditions.

02

Progressive capacity increase improves disentanglement and reconstruction.

03

Theoretical assessment clarifies conditions for disentangled representation emergence.

Abstract

We present new intuitions and theoretical assessments of the emergence of disentangled representation in variational autoencoders. Taking a rate-distortion theory perspective, we show the circumstances under which representations aligned with the underlying generative factors of variation of data emerge when optimising the modified ELBO bound in $β$ -VAE, as training progresses. From these insights, we propose a modification to the training regime of $β$ -VAE, that progressively increases the information capacity of the latent code during training. This modification facilitates the robust learning of disentangled representations in $β$ -VAE, without the previous trade-off in reconstruction accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Datasets

agrawalchaitany/cyberbert_dataset
dataset· 7 dl
7 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques · Digital Media Forensic Detection