3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

Ji Hou; Angela Dai; Matthias Nie{\ss}ner

arXiv:1812.07003·cs.CV·April 30, 2019·21 cites

3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

Ji Hou, Angela Dai, Matthias Nie{\ss}ner

PDF

Open Access 1 Repo

TL;DR

3D-SIS is a neural network that combines geometric and color information from multi-view RGB-D scans to improve 3D semantic instance segmentation accuracy, outperforming existing methods on benchmark datasets.

Contribution

The paper introduces a novel multi-modal neural network architecture that fuses 2D image features with 3D volumetric data for enhanced 3D instance segmentation.

Findings

01

Achieves over 13 mAP improvement on real-world benchmarks.

02

Effectively fuses multi-view RGB-D data for better segmentation.

03

Outperforms state-of-the-art methods in accuracy.

Abstract

We introduce 3D-SIS, a novel neural network architecture for 3D semantic instance segmentation in commodity RGB-D scans. The core idea of our method is to jointly learn from both geometric and color signal, thus enabling accurate instance predictions. Rather than operate solely on 2D frames, we observe that most computer vision applications have multi-view RGB-D input available, which we leverage to construct an approach for 3D instance segmentation that effectively fuses together these multi-modal inputs. Our network leverages high-resolution RGB input by associating 2D images with the volumetric grid based on the pose alignment of the 3D reconstruction. For each image, we first extract 2D features for each pixel with a series of 2D convolutions; we then backproject the resulting feature vector to the associated voxel in the 3D grid. This combination of 2D and 3D feature learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Sekunde/3D-SIS
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Object Detection Techniques · Robotics and Sensor-Based Localization · 3D Surveying and Cultural Heritage