OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting
Hongjia Zhai, Qi Zhang, Xiaokun Pan, Xiyu Zhang, Yitong Dong, Huaqi Zhang, Dan Xu, Guofeng Zhang

TL;DR
OnlinePG introduces an online, open-vocabulary panoptic mapping system that combines geometric reconstruction and semantic understanding using 3D Gaussian Splatting, enabling real-time scene perception for robotics.
Contribution
It is the first online system integrating 3D Gaussian Splatting with open-vocabulary perception for panoptic mapping in real-time.
Findings
Outperforms existing online methods in accuracy.
Maintains real-time processing speed.
Effectively fuses geometric and semantic cues.
Abstract
Open-vocabulary scene understanding with online panoptic mapping is essential for embodied applications to perceive and interact with environments. However, existing methods are predominantly offline or lack instance-level understanding, limiting their applicability to real-world robotic tasks. In this paper, we propose OnlinePG, a novel and effective system that integrates geometric reconstruction and open-vocabulary perception using 3D Gaussian Splatting in an online setting. Technically, to achieve online panoptic mapping, we employ an efficient local-to-global paradigm with a sliding window. To build local consistency map, we construct a 3D segment clustering graph that jointly leverages geometric and semantic cues, fusing inconsistent segments within sliding window into complete instances. Subsequently, to update the global map, we construct explicit grids with spatial attributes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Multimodal Machine Learning Applications · 3D Shape Modeling and Analysis
