Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting
Boying Li, Vuong Chi Hao, Peter J. Stuckey, Ian Reid, Hamid Rezatofighi

TL;DR
Hier-SLAM++ introduces a neuro-symbolic hierarchical semantic 3D Gaussian Splatting SLAM system that effectively combines semantic and geometric information, enabling accurate pose estimation and 3D mapping with reduced sensor and computational requirements.
Contribution
It presents a novel hierarchical representation for semantic SLAM, integrating large language models and 3D generative models for end-to-end learning, and is the first monocular Gaussian Splatting SLAM system.
Findings
Achieves superior or comparable performance to state-of-the-art methods.
Reduces storage and training time significantly.
Supports both RGB-D and monocular inputs effectively.
Abstract
We propose Hier-SLAM++, a comprehensive Neuro-Symbolic semantic 3D Gaussian Splatting SLAM method with both RGB-D and monocular input featuring an advanced hierarchical categorical representation, which enables accurate pose estimation as well as global 3D semantic mapping. The parameter usage in semantic SLAM systems increases significantly with the growing complexity of the environment, making scene understanding particularly challenging and costly. To address this problem, we introduce a novel hierarchical representation that encodes both semantic and geometric information in a compact form into 3D Gaussian Splatting, leveraging the capabilities of large language models (LLMs) as well as the 3D generative model. By utilizing the proposed hierarchical tree structure, semantic information is symbolically represented and learned in an end-to-end manner. We further introduce an advanced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
