Layer-structured 3D Scene Inference via View Synthesis

Shubham Tulsiani; Richard Tucker; Noah Snavely

arXiv:1807.10264·cs.CV·July 27, 2018·5 cites

Layer-structured 3D Scene Inference via View Synthesis

Shubham Tulsiani, Richard Tucker, Noah Snavely

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to infer layered 3D scene representations from a single image by using view synthesis as supervision, enabling the capture of hidden scene content without direct labels.

Contribution

It proposes a novel, differentiable view renderer and a learning framework that leverages multi-view signals to infer comprehensive 3D scene layers from a single image.

Findings

01

Successfully infers depth and texture for hidden scene content.

02

Achieves accurate scene reconstructions in multiple settings.

03

Demonstrates the effectiveness of view synthesis as supervision.

Abstract

We present an approach to infer a layer-structured 3D representation of a scene from a single input image. This allows us to infer not only the depth of the visible pixels, but also to capture the texture and depth for content in the scene that is not directly visible. We overcome the challenge posed by the lack of direct supervision by instead leveraging a more naturally available multi-view supervisory signal. Our insight is to use view synthesis as a proxy task: we enforce that our representation (inferred from a single image), when rendered from a novel perspective, matches the true observed image. We present a learning framework that operationalizes this insight using a new, differentiable novel view renderer. We provide qualitative and quantitative validation of our approach in two different settings, and demonstrate that we can learn to capture the hidden aspects of a scene.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

VCL3D/SphericalViewSynthesis
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Robotics and Sensor-Based Localization