Disentangling factors of variation in deep representations using   adversarial training

Michael Mathieu; Junbo Zhao; Pablo Sprechmann; Aditya Ramesh; Yann; LeCun

arXiv:1611.03383·cs.LG·November 11, 2016·252 cites

Disentangling factors of variation in deep representations using adversarial training

Michael Mathieu, Junbo Zhao, Pablo Sprechmann, Aditya Ramesh, Yann, LeCun

PDF

Open Access 3 Repos

TL;DR

This paper presents a method combining autoencoders and adversarial training to disentangle specified and unspecified factors of variation in deep representations, enabling better understanding and manipulation of complex data.

Contribution

It introduces a novel approach for disentangling factors of variation without explicit labels for unspecified factors using adversarial training.

Findings

01

Effective disentanglement of factors in synthetic and real datasets

02

Generalizes to unseen classes and intra-class variability

03

Improves single-image analogy tasks

Abstract

We introduce a conditional generative model for learning to disentangle the hidden factors of variation within a set of labeled observations, and separate them into complementary codes. One code summarizes the specified factors of variation associated with the labels. The other summarizes the remaining unspecified variability. During training, the only available source of supervision comes from our ability to distinguish among different observations belonging to the same class. Examples of such observations include images of a set of labeled objects captured at different viewpoints, or recordings of set of speakers dictating multiple phrases. In both instances, the intra-class diversity is the source of the unspecified factors of variation: each object is observed at multiple viewpoints, and each speaker dictates multiple phrases. Learning to disentangle the specified factors from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Image Processing and 3D Reconstruction