Technical Report: Co-learning of geometry and semantics for online 3D   mapping

Marcela Carvalho; Maxime Ferrera; Alexandre Boulch; Julien Moras,; Bertrand Le Saux; Pauline Trouv\'e-Peloux

arXiv:1911.01082·cs.CV·November 5, 2019·1 cites

Technical Report: Co-learning of geometry and semantics for online 3D mapping

Marcela Carvalho, Maxime Ferrera, Alexandre Boulch, Julien Moras,, Bertrand Le Saux, Pauline Trouv\'e-Peloux

PDF

Open Access

TL;DR

This paper presents a multi-task neural network for joint depth and semantic segmentation to improve 3D semantic reconstruction for autonomous navigation, demonstrating superior performance on the 3DRMS Challenge dataset.

Contribution

It introduces a deep multi-task neural network that refines depth and semantic segmentation simultaneously for enhanced 3D mapping.

Findings

01

Outperforms state-of-the-art methods on the 3DRMS dataset

02

Produces accurate semantic 3D point clouds and meshes

03

Effective co-learning improves reconstruction quality

Abstract

This paper is a technical report about our submission for the ECCV 2018 3DRMS Workshop Challenge on Semantic 3D Reconstruction \cite{Tylecek2018rms}. In this paper, we address 3D semantic reconstruction for autonomous navigation using co-learning of depth map and semantic segmentation. The core of our pipeline is a deep multi-task neural network which tightly refines depth and also produces accurate semantic segmentation maps. Its inputs are an image and a raw depth map produced from a pair of images by standard stereo vision. The resulting semantic 3D point clouds are then merged in order to create a consistent 3D mesh, in turn used to produce dense semantic 3D reconstruction maps. The performances of each step of the proposed method are evaluated on the dataset and multiple tasks of the 3DRMS Challenge, and repeatedly surpass state-of-the-art approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Human Pose and Action Recognition