Ternary-Type Opacity and Hybrid Odometry for RGB NeRF-SLAM

Junru Lin; Asen Nachkov; Songyou Peng; Luc Van Gool; Danda Pani Paudel

arXiv:2312.13332·cs.CV·September 24, 2024·1 cites

Ternary-Type Opacity and Hybrid Odometry for RGB NeRF-SLAM

Junru Lin, Asen Nachkov, Songyou Peng, Luc Van Gool, Danda Pani Paudel

PDF

Open Access

TL;DR

This paper introduces a novel ternary opacity model and hybrid odometry scheme to enhance RGB-only NeRF-SLAM, achieving state-of-the-art accuracy and speed in complex environment navigation without depth data.

Contribution

It proposes a ternary opacity model and a hybrid odometry method, significantly improving NeRF-SLAM performance using only RGB inputs.

Findings

01

Achieved state-of-the-art accuracy on synthetic and real datasets.

02

Enhanced depth rendering through the ternary opacity model.

03

Improved speed and robustness in SLAM tasks.

Abstract

In this work, we address the challenge of deploying Neural Radiance Field (NeRFs) in Simultaneous Localization and Mapping (SLAM) under the condition of lacking depth information, relying solely on RGB inputs. The key to unlocking the full potential of NeRF in such a challenging context lies in the integration of real-world priors. A crucial prior we explore is the binary opacity prior of 3D space with opaque objects. To effectively incorporate this prior into the NeRF framework, we introduce a ternary-type opacity (TT) model instead, which categorizes points on a ray intersecting a surface into three regions: before, on, and behind the surface. This enables a more accurate rendering of depth, subsequently improving the performance of image warping techniques. Therefore, we further propose a novel hybrid odometry (HO) scheme that merges bundle adjustment and warping-based localization.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · Robotics and Sensor-Based Localization

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings