OctNetFusion: Learning Depth Fusion from Data
Gernot Riegler, Ali Osman Ulusoy, Horst Bischof, Andreas Geiger

TL;DR
This paper introduces a deep learning approach for 3D depth fusion that improves reconstruction quality by handling occlusions, reducing noise, and filling gaps, surpassing traditional methods.
Contribution
It proposes a novel 3D CNN architecture that learns implicit surface representations from depth maps, enabling better reconstruction and shape completion.
Findings
Outperforms traditional TSDF and TV-L1 fusion methods.
Achieves state-of-the-art 3D shape completion results.
Effectively reconstructs occluded regions and fills gaps.
Abstract
In this paper, we present a learning based approach to depth fusion, i.e., dense 3D reconstruction from multiple depth images. The most common approach to depth fusion is based on averaging truncated signed distance functions, which was originally proposed by Curless and Levoy in 1996. While this method is simple and provides great results, it is not able to reconstruct (partially) occluded surfaces and requires a large number frames to filter out sensor noise and outliers. Motivated by the availability of large 3D model repositories and recent advances in deep learning, we present a novel 3D CNN architecture that learns to predict an implicit surface representation from the input depth maps. Our learning based method significantly outperforms the traditional volumetric fusion approach in terms of noise reduction and outlier suppression. By learning the structure of real world 3D…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Image and Object Detection Techniques
