RayDF: Neural Ray-surface Distance Fields with Multi-view Consistency
Zhuoman Liu, Bo Yang, Yan Luximon, Ajay Kumar, Jinxi Li

TL;DR
RayDF introduces a novel neural framework for 3D shape representation that ensures multi-view consistency, enabling faster rendering and improved surface reconstruction compared to existing methods.
Contribution
The paper presents RayDF, a new ray-based neural shape representation with multi-view consistency, dual-ray visibility classifier, and significant speed and accuracy improvements.
Findings
Achieves superior 3D surface point reconstruction on multiple datasets.
Surpasses existing coordinate-based and ray-based methods in performance.
Enables 1000x faster rendering of depth images.
Abstract
In this paper, we study the problem of continuous 3D shape representations. The majority of existing successful methods are coordinate-based implicit neural representations. However, they are inefficient to render novel views or recover explicit surface points. A few works start to formulate 3D shapes as ray-based neural functions, but the learned structures are inferior due to the lack of multi-view geometry consistency. To tackle these challenges, we propose a new framework called RayDF. It consists of three major components: 1) the simple ray-surface distance field, 2) the novel dual-ray visibility classifier, and 3) a multi-view consistency optimization module to drive the learned ray-surface distances to be multi-view geometry consistent. We extensively evaluate our method on three public datasets, demonstrating remarkable performance in 3D surface point reconstruction on both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
