Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors
Pengchong Hu, Zhizhong Han

TL;DR
This paper introduces an attentive depth fusion prior for neural implicit representations, improving 3D reconstruction accuracy from multi-view RGBD images by effectively handling occlusions and holes in depth data.
Contribution
It proposes a novel attention mechanism that leverages fused TSDF depth priors, enhancing neural implicit learning from multi-view RGBD images, applicable to static scenes and SLAM.
Findings
Outperforms recent neural implicit methods on benchmark datasets
Effectively handles occlusions and incomplete depth data
Works with both scene-wide and incremental TSDF fusion
Abstract
Learning neural implicit representations has achieved remarkable performance in 3D reconstruction from multi-view images. Current methods use volume rendering to render implicit representations into either RGB or depth images that are supervised by multi-view ground truth. However, rendering a view each time suffers from incomplete depth at holes and unawareness of occluded structures from the depth supervision, which severely affects the accuracy of geometry inference via volume rendering. To resolve this issue, we propose to learn neural implicit representations from multi-view RGBD images through volume rendering with an attentive depth fusion prior. Our prior allows neural networks to perceive coarse 3D structures from the Truncated Signed Distance Function (TSDF) fused from all depth images available for rendering. The TSDF enables accessing the missing depth at holes on one depth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Surveying and Cultural Heritage · 3D Shape Modeling and Analysis
