Semantic Dense Reconstruction with Consistent Scene Segments
Yingcai Wan, Yanyan Li, Yingxuan You, Cheng Guo, Lijin Fang and, Federico Tombari

TL;DR
This paper introduces a method for dense semantic 3D scene reconstruction from RGB-D sequences, combining consistent 2D segmentation, 3D modeling, and a novel semantic projection block to improve scene understanding.
Contribution
It proposes a new framework integrating 2D semantic segmentation, 3D modeling, and a semantic projection block for enhanced dense reconstruction and semantic prediction.
Findings
Achieves accurate 3D dense reconstruction.
Attains state-of-the-art semantic prediction performance.
Demonstrates robustness on public datasets.
Abstract
In this paper, a method for dense semantic 3D scene reconstruction from an RGB-D sequence is proposed to solve high-level scene understanding tasks. First, each RGB-D pair is consistently segmented into 2D semantic maps based on a camera tracking backbone that propagates objects' labels with high probabilities from full scans to corresponding ones of partial views. Then a dense 3D mesh model of an unknown environment is incrementally generated from the input RGB-D sequence. Benefiting from 2D consistent semantic segments and the 3D model, a novel semantic projection block (SP-Block) is proposed to extract deep feature volumes from 2D segments of different views. Moreover, the semantic volumes are fused into deep volumes from a point cloud encoder to make the final semantic segmentation. Extensive experimental evaluations on public datasets show that our system achieves accurate 3D dense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · 3D Shape Modeling and Analysis
