HoloGAN: Unsupervised learning of 3D representations from natural images

Thu Nguyen-Phuoc; Chuan Li; Lucas Theis; Christian Richardt,; Yong-Liang Yang

arXiv:1904.01326·cs.CV·October 2, 2019·1 cites

HoloGAN: Unsupervised learning of 3D representations from natural images

Thu Nguyen-Phuoc, Chuan Li, Lucas Theis, Christian Richardt,, Yong-Liang Yang

PDF

Open Access 3 Repos

TL;DR

HoloGAN introduces an unsupervised 3D-aware GAN that learns explicit 3D representations from natural images, enabling pose control and disentanglement of shape and appearance without supervision.

Contribution

It is the first generative model to learn 3D representations from natural images in an entirely unsupervised manner, with explicit pose control.

Findings

01

Enables disentanglement of 3D pose and identity

02

Generates high-quality images with 3D understanding

03

Does not require pose labels or 3D data during training

Abstract

We propose a novel generative adversarial network (GAN) for the task of unsupervised learning of 3D representations from natural images. Most generative models rely on 2D kernels to generate images and make few assumptions about the 3D world. These models therefore tend to create blurry images or artefacts in tasks that require a strong 3D understanding, such as novel-view synthesis. HoloGAN instead learns a 3D representation of the world, and to render this representation in a realistic manner. Unlike other GANs, HoloGAN provides explicit control over the pose of generated objects through rigid-body transformations of the learnt 3D features. Our experiments show that using explicit 3D features enables HoloGAN to disentangle 3D pose and identity, which is further decomposed into shape and appearance, while still being able to generate images with similar or higher visual quality than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging