Learning Neural Implicit through Volume Rendering with Attentive Depth   Fusion Priors

Pengchong Hu; Zhizhong Han

arXiv:2310.11598·cs.CV·January 9, 2024·2 cites

Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors

Pengchong Hu, Zhizhong Han

PDF

Open Access 1 Video

TL;DR

This paper introduces an attentive depth fusion prior for neural implicit representations, improving 3D reconstruction accuracy from multi-view RGBD images by effectively handling occlusions and holes in depth data.

Contribution

It proposes a novel attention mechanism that leverages fused TSDF depth priors, enhancing neural implicit learning from multi-view RGBD images, applicable to static scenes and SLAM.

Findings

01

Outperforms recent neural implicit methods on benchmark datasets

02

Effectively handles occlusions and incomplete depth data

03

Works with both scene-wide and incremental TSDF fusion

Abstract

Learning neural implicit representations has achieved remarkable performance in 3D reconstruction from multi-view images. Current methods use volume rendering to render implicit representations into either RGB or depth images that are supervised by multi-view ground truth. However, rendering a view each time suffers from incomplete depth at holes and unawareness of occluded structures from the depth supervision, which severely affects the accuracy of geometry inference via volume rendering. To resolve this issue, we propose to learn neural implicit representations from multi-view RGBD images through volume rendering with an attentive depth fusion prior. Our prior allows neural networks to perceive coarse 3D structures from the Truncated Signed Distance Function (TSDF) fused from all depth images available for rendering. The TSDF enables accessing the missing depth at holes on one depth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors· slideslive

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis