TL;DR
This paper introduces a novel geometry-only supervision framework for open-vocabulary 3D occupancy prediction in indoor scenes, leveraging Gaussian representations and a progressive sharpening technique to improve semantic and geometric understanding.
Contribution
It proposes a new Gaussian-based approach with a stabilization method and a progressive semantic sharpening schedule for better indoor occupancy prediction.
Findings
Achieves 59.50 IoU and 21.05 mIoU on Occ-ScanNet in open-vocabulary setting
Surpasses existing occupancy methods in IoU and prior open-vocabulary approaches in mIoU
Introduces a stabilization technique and a semantic sharpening schedule that improve indoor scene understanding
Abstract
Open-vocabulary 3D occupancy is vital for embodied agents, which need to understand complex indoor environments where semantic categories are abundant and evolve beyond fixed taxonomies. While recent work has explored open-vocabulary occupancy in outdoor driving scenarios, such methods transfer poorly indoors, where geometry is denser, layouts are more intricate, and semantics are far more fine-grained. To address these challenges, we adopt a geometry-only supervision paradigm that uses only binary occupancy labels (occupied vs free). Our framework builds upon 3D Language-Embedded Gaussians, which serve as a unified intermediate representation coupling fine-grained 3D geometry with a language-aligned semantic embedding. On the geometry side, we find that existing Gaussian-to-Occupancy operators fail to converge under such weak supervision, and we introduce an opacity-aware,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization · 3D Modeling in Geospatial Applications
