Two Stream 3D Semantic Scene Completion

Martin Garbade; Yueh-Tung Chen; Johann Sawatzky; Juergen Gall

arXiv:1804.03550·cs.CV·May 16, 2019

Two Stream 3D Semantic Scene Completion

Martin Garbade, Yueh-Tung Chen, Johann Sawatzky, Juergen Gall

PDF

TL;DR

This paper introduces a two-stream 3D semantic scene completion method that combines depth and RGB-based semantic information, significantly improving over existing approaches in inferring complete 3D scenes from partial data.

Contribution

It proposes a novel two-stream approach using depth and RGB semantic cues with a 3D CNN, advancing the accuracy of semantic scene completion.

Findings

01

Outperforms state-of-the-art methods in semantic scene completion

02

Uses a compact three-channel semantic encoding

03

Employs a 3D CNN for inference

Abstract

Inferring the 3D geometry and the semantic meaning of surfaces, which are occluded, is a very challenging task. Recently, a first end-to-end learning approach has been proposed that completes a scene from a single depth image. The approach voxelizes the scene and predicts for each voxel if it is occupied and, if it is occupied, the semantic class label. In this work, we propose a two stream approach that leverages depth information and semantic information, which is inferred from the RGB image, for this task. The approach constructs an incomplete 3D semantic tensor, which uses a compact three-channel encoding for the inferred semantic information, and uses a 3D CNN to infer the complete 3D semantic tensor. In our experimental evaluation, we show that the proposed two stream approach substantially outperforms the state-of-the-art for semantic scene completion.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.