Learned Semantic Multi-Sensor Depth Map Fusion
Denys Rozumnyi, Ian Cherabier, Marc Pollefeys, Martin R. Oswald

TL;DR
This paper introduces a neural network framework that integrates semantics, multi-sensor data, denoising, and scene completion into a unified 3D reconstruction method, improving over traditional volumetric fusion techniques.
Contribution
It is the first approach to unify semantic information, multi-sensor fusion, denoising, and scene completion within a single neural network-based framework.
Findings
Clear improvements demonstrated on synthetic data
Effective scene denoising and hole filling
Unification of multiple properties in 3D reconstruction
Abstract
Volumetric depth map fusion based on truncated signed distance functions has become a standard method and is used in many 3D reconstruction pipelines. In this paper, we are generalizing this classic method in multiple ways: 1) Semantics: Semantic information enriches the scene representation and is incorporated into the fusion process. 2) Multi-Sensor: Depth information can originate from different sensors or algorithms with very different noise and outlier statistics which are considered during data fusion. 3) Scene denoising and completion: Sensors can fail to recover depth for certain materials and light conditions, or data is missing due to occlusions. Our method denoises the geometry, closes holes and computes a watertight surface for every semantic class. 4) Learning: We propose a neural network reconstruction method that unifies all these properties within a single powerful…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
