Disentangling factors of variation in deep representations using adversarial training
Michael Mathieu, Junbo Zhao, Pablo Sprechmann, Aditya Ramesh, Yann, LeCun

TL;DR
This paper presents a method combining autoencoders and adversarial training to disentangle specified and unspecified factors of variation in deep representations, enabling better understanding and manipulation of complex data.
Contribution
It introduces a novel approach for disentangling factors of variation without explicit labels for unspecified factors using adversarial training.
Findings
Effective disentanglement of factors in synthetic and real datasets
Generalizes to unseen classes and intra-class variability
Improves single-image analogy tasks
Abstract
We introduce a conditional generative model for learning to disentangle the hidden factors of variation within a set of labeled observations, and separate them into complementary codes. One code summarizes the specified factors of variation associated with the labels. The other summarizes the remaining unspecified variability. During training, the only available source of supervision comes from our ability to distinguish among different observations belonging to the same class. Examples of such observations include images of a set of labeled objects captured at different viewpoints, or recordings of set of speakers dictating multiple phrases. In both instances, the intra-class diversity is the source of the unspecified factors of variation: each object is observed at multiple viewpoints, and each speaker dictates multiple phrases. Learning to disentangle the specified factors from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Image Processing and 3D Reconstruction
