VTGaussian-SLAM: RGBD SLAM for Large Scale Scenes with Splatting View-Tied 3D Gaussians
Pengchong Hu, Zhizhong Han

TL;DR
This paper introduces VTGaussian-SLAM, a scalable RGBD SLAM system using view-tied 3D Gaussians that improves efficiency, rendering quality, and tracking accuracy for large-scale scenes.
Contribution
The paper proposes view-tied 3D Gaussians and novel strategies that enable scalable, high-quality RGBD SLAM without extensive GPU memory usage or learning all Gaussian parameters.
Findings
Outperforms state-of-the-art methods on benchmark datasets.
Achieves higher rendering and tracking accuracy.
Demonstrates improved scalability for large scenes.
Abstract
Jointly estimating camera poses and mapping scenes from RGBD images is a fundamental task in simultaneous localization and mapping (SLAM). State-of-the-art methods employ 3D Gaussians to represent a scene, and render these Gaussians through splatting for higher efficiency and better rendering. However, these methods cannot scale up to extremely large scenes, due to the inefficient tracking and mapping strategies that need to optimize all 3D Gaussians in the limited GPU memories throughout the training to maintain the geometry and color consistency to previous RGBD observations. To resolve this issue, we propose novel tracking and mapping strategies to work with a novel 3D representation, dubbed view-tied 3D Gaussians, for RGBD SLAM systems. View-tied 3D Gaussians is a kind of simplified Gaussians, which is tied to depth pixels, without needing to learn locations, rotations, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRobotics and Sensor-Based Localization · Remote Sensing and LiDAR Applications · 3D Surveying and Cultural Heritage
