Unsupervised Learning of 3D Structure from Images

Danilo Jimenez Rezende; S. M. Ali Eslami; Shakir Mohamed and; Peter Battaglia; Max Jaderberg; Nicolas Heess

arXiv:1607.00662·cs.CV·June 20, 2018·98 cites

Unsupervised Learning of 3D Structure from Images

Danilo Jimenez Rezende, S. M. Ali Eslami, Shakir Mohamed and, Peter Battaglia, Max Jaderberg, Nicolas Heess

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel unsupervised deep learning approach to infer 3D structures from 2D images, achieving high-quality results and establishing new benchmarks in the field.

Contribution

It presents the first end-to-end trainable deep generative models for 3D structure inference from images without supervision.

Findings

01

High-quality 3D samples generated

02

Achieved competitive log-likelihoods on ShapeNet

03

First benchmarks for unsupervised 3D inference

Abstract

A key goal of computer vision is to recover the underlying 3D structure from 2D observations of the world. In this paper we learn strong deep generative models of 3D structures, and recover these structures from 3D and 2D images via probabilistic inference. We demonstrate high-quality samples and report log-likelihoods on several datasets, including ShapeNet [2], and establish the first benchmarks in the literature. We also show how these models and their inference networks can be trained end-to-end from 2D images. This demonstrates for the first time the feasibility of learning to infer 3D representations of the world in a purely unsupervised manner.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fgolemo/threedee-tools
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Image Processing and 3D Reconstruction