TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural   Radiance Field Optimization

Zhen Tan; Zongtan Zhou; Yangbing Ge; Zi Wang; Xieyuanli Chen; Dewen Hu

arXiv:2405.07027·cs.CV·October 8, 2024

TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization

Zhen Tan, Zongtan Zhou, Yangbing Ge, Zi Wang, Xieyuanli Chen, Dewen Hu

PDF

Open Access 1 Repo

TL;DR

TD-NeRF introduces a novel method for jointly optimizing camera poses and neural radiance fields from monocular images, effectively utilizing depth priors with a truncated normal sampling, coarse-to-fine training, and robust constraints to improve accuracy and convergence.

Contribution

It proposes a new approach that explicitly leverages monocular depth priors for joint camera pose and NeRF optimization, addressing noise and local minima issues.

Findings

01

Outperforms prior methods in pose and depth accuracy

02

Faster convergence in training process

03

Produces more accurate 3D reconstructions

Abstract

The reliance on accurate camera poses is a significant barrier to the widespread deployment of Neural Radiance Fields (NeRF) models for 3D reconstruction and SLAM tasks. The existing method introduces monocular depth priors to jointly optimize the camera poses and NeRF, which fails to fully exploit the depth priors and neglects the impact of their inherent noise. In this paper, we propose Truncated Depth NeRF (TD-NeRF), a novel approach that enables training NeRF from unknown camera poses - by jointly optimizing learnable parameters of the radiance field and camera poses. Our approach explicitly utilizes monocular depth priors through three key advancements: 1) we propose a novel depth-based ray sampling strategy based on the truncated normal distribution, which improves the convergence speed and accuracy of pose estimation; 2) to circumvent local minima and refine depth geometry, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nubot-nudt/td-nerf
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Optical measurement and interference techniques

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings