UP-SLAM: Adaptively Structured Gaussian SLAM with Uncertainty Prediction in Dynamic Environments
Wancai Zheng, Linlin Ou, Jiajie He, Libo Zhou, Xinyi Yu, Yan Wei

TL;DR
UP-SLAM introduces a real-time RGB-D SLAM system that effectively handles dynamic environments by decoupling tracking and mapping, employing adaptive probabilistic structures, and estimating motion uncertainty without semantic labels, leading to improved accuracy and rendering quality.
Contribution
The paper presents UP-SLAM, a novel SLAM system that combines adaptive Gaussian primitives, a training-free uncertainty estimator, and a temporal encoder to enhance robustness and efficiency in dynamic scenes.
Findings
Outperforms state-of-the-art in localization accuracy by 59.8%.
Improves rendering quality with a 4.57 dB PSNR increase.
Maintains real-time performance while handling dynamic environments.
Abstract
Recent 3D Gaussian Splatting (3DGS) techniques for Visual Simultaneous Localization and Mapping (SLAM) have significantly progressed in tracking and high-fidelity mapping. However, their sequential optimization framework and sensitivity to dynamic objects limit real-time performance and robustness in real-world scenarios. We present UP-SLAM, a real-time RGB-D SLAM system for dynamic environments that decouples tracking and mapping through a parallelized framework. A probabilistic octree is employed to manage Gaussian primitives adaptively, enabling efficient initialization and pruning without hand-crafted thresholds. To robustly filter dynamic regions during tracking, we propose a training-free uncertainty estimator that fuses multi-modal residuals to estimate per-pixel motion uncertainty, achieving open-set dynamic object handling without reliance on semantic labels. Furthermore, a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Robot Manipulation and Learning
MethodsAttention Is All You Need · Linear Layer · Softmax · Multi-Head Attention · Layer Normalization · Residual Connection · Dense Connections · Vision Transformer · Pruning · self-DIstillation with NO labels
