Unsupervised Generative 3D Shape Learning from Natural Images
Attila Szab\'o, Givi Meishvili, Paolo Favaro

TL;DR
This paper introduces a fully unsupervised method for learning 3D shape generative models directly from natural images, without relying on 3D annotations, stereo videos, or ego-motion data.
Contribution
It presents a novel two-stage GAN-based approach that learns 3D representations from 2D images, disentangling shape from viewpoint, using a differentiable renderer.
Findings
Successfully learned 3D face shapes from natural images
Achieved realistic 3D shape generation without 3D supervision
Demonstrated disentangled 3D representation learning
Abstract
In this paper we present, to the best of our knowledge, the first method to learn a generative model of 3D shapes from natural images in a fully unsupervised way. For example, we do not use any ground truth 3D or 2D annotations, stereo video, and ego-motion during the training. Our approach follows the general strategy of Generative Adversarial Networks, where an image generator network learns to create image samples that are realistic enough to fool a discriminator network into believing that they are natural images. In contrast, in our approach the image generation is split into 2 stages. In the first stage a generator network outputs 3D objects. In the second, a differentiable renderer produces an image of the 3D objects from random viewpoints. The key observation is that a realistic 3D object should yield a realistic rendering from any plausible viewpoint. Thus, by randomizing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Image Retrieval and Classification Techniques · Advanced Vision and Imaging
