Inverse Graphics: Unsupervised Learning of 3D Shapes from Single Images
Talip Ucar

TL;DR
This paper introduces an unsupervised approach to learn 3D shapes from single images using generative models, demonstrating promising results on diverse datasets without supervision.
Contribution
It presents a novel end-to-end unsupervised method for 3D shape and pose recovery from single images, applicable to both synthetic and real datasets.
Findings
Successfully recovers 3D shapes from MNIST and Fashion datasets.
Adversarial training yields denser 3D shapes than variational auto-encoders.
Can recover complete 3D shape from a single image when objects are symmetric.
Abstract
Using generative models for Inverse Graphics is an active area of research. However, most works focus on developing models for supervised and semi-supervised methods. In this paper, we study the problem of unsupervised learning of 3D geometry from single images. Our approach is to use a generative model that produces 2-D images as projections of a latent 3D voxel grid, which we train either as a variational auto-encoder or using adversarial methods. Our contributions are as follows: First, we show how to recover 3D shape and pose from general datasets such as MNIST, and MNIST Fashion in good quality. Second, we compare the shapes learned using adversarial and variational methods. Adversarial approach gives denser 3D shapes. Third, we explore the idea of modelling the pose of an object as uniform distribution to recover 3D shape from a single image. Our experiment with the CelebA dataset…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Computer Graphics and Visualization Techniques
