TL;DR
GS3LAM introduces a real-time, dense semantic SLAM framework using Gaussian Splatting that effectively fuses multimodal data for improved mapping and localization.
Contribution
It proposes a novel Gaussian Semantic Splatting approach with joint optimization, depth-adaptive regularization, and a keyframe mapping strategy, advancing semantic SLAM capabilities.
Findings
Achieves higher tracking robustness and semantic accuracy.
Provides superior rendering quality over existing methods.
Demonstrates real-time performance on benchmark datasets.
Abstract
Recently, the multi-modal fusion of RGB, depth, and semantics has shown great potential in dense Simultaneous Localization and Mapping (SLAM). However, a prerequisite for generating consistent semantic maps is the availability of dense, efficient, and scalable scene representations. Existing semantic SLAM systems based on explicit representations are often limited by resolution and an inability to predict unknown areas. Conversely, implicit representations typically rely on time-consuming ray tracing, failing to meet real-time requirements. Fortunately, 3D Gaussian Splatting (3DGS) has emerged as a promising representation that combines the efficiency of point-based methods with the continuity of geometric structures. To this end, we propose GS3LAM, a Gaussian Semantic Splatting SLAM framework that processes multimodal data to render consistent, dense semantic maps in real-time. GS3LAM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
