Vox-Fusion: Dense Tracking and Mapping with Voxel-based Neural Implicit   Representation

Xingrui Yang; Hai Li; Hongjia Zhai; Yuhang Ming; Yuqian Liu; Guofeng; Zhang

arXiv:2210.15858·cs.CV·March 7, 2023

Vox-Fusion: Dense Tracking and Mapping with Voxel-based Neural Implicit Representation

Xingrui Yang, Hai Li, Hongjia Zhai, Yuhang Ming, Yuqian Liu, Guofeng, Zhang

PDF

1 Repo

TL;DR

Vox-Fusion introduces a voxel-based neural implicit mapping system that enables real-time dense tracking and mapping of arbitrary scenes, improving accuracy and supporting AR/VR applications.

Contribution

It extends implicit mapping with an octree structure and a multi-process framework for practical, real-time scene reconstruction.

Findings

01

Achieves better accuracy and completeness than previous methods.

02

Supports real-time performance for AR/VR applications.

03

Handles arbitrary scenes without prior environment knowledge.

Abstract

In this work, we present a dense tracking and mapping system named Vox-Fusion, which seamlessly fuses neural implicit representations with traditional volumetric fusion methods. Our approach is inspired by the recently developed implicit mapping and positioning system and further extends the idea so that it can be freely applied to practical scenarios. Specifically, we leverage a voxel-based neural implicit surface representation to encode and optimize the scene inside each voxel. Furthermore, we adopt an octree-based structure to divide the scene and support dynamic expansion, enabling our system to track and map arbitrary scenes without knowing the environment like in previous works. Moreover, we proposed a high-performance multi-process framework to speed up the method, thus supporting some applications that require real-time performance. The evaluation results show that our methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zju3dv/vox-fusion
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings