Multi-view 3D Models from Single Images with a Convolutional Network

Maxim Tatarchenko; Alexey Dosovitskiy; Thomas Brox

arXiv:1511.06702·cs.CV·August 3, 2016·1 cites

Multi-view 3D Models from Single Images with a Convolutional Network

Maxim Tatarchenko, Alexey Dosovitskiy, Thomas Brox

PDF

Open Access

TL;DR

This paper introduces a convolutional network that infers 3D representations from single images, predicting RGB views and depth maps to reconstruct full 3D models of objects, including real-world images.

Contribution

It is the first to generate complete 3D models from a single image using a convolutional network trained on synthetic data.

Findings

01

Successfully predicts 3D structures from single images.

02

Handles cluttered backgrounds and real images.

03

Generates accurate point clouds and surface meshes.

Abstract

We present a convolutional network capable of inferring a 3D representation of a previously unseen object given a single image of this object. Concretely, the network can predict an RGB image and a depth map of the object as seen from an arbitrary view. Several of these depth maps fused together give a full point cloud of the object. The point cloud can in turn be transformed into a surface mesh. The network is trained on renderings of synthetic 3D models of cars and chairs. It successfully deals with objects on cluttered background and generates reasonable predictions for real images of cars.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis