OVO: Open-Vocabulary Occupancy
Zhiyu Tan, Zichao Dong, Cheng Zhang, Weikun Zhang, Hang Ji, Hao Li

TL;DR
This paper introduces OVO, a novel method enabling semantic occupancy prediction for arbitrary classes without requiring 3D annotations during training, by leveraging knowledge distillation from 2D models and pixel-voxel filtering.
Contribution
OVO is the first approach to perform open-vocabulary semantic occupancy prediction without 3D annotations, using knowledge distillation and filtering techniques.
Findings
Achieves competitive results on NYUv2 and SemanticKITTI datasets.
Compatible with most state-of-the-art semantic occupancy models.
Provides extensive analysis and ablation studies.
Abstract
Semantic occupancy prediction aims to infer dense geometry and semantics of surroundings for an autonomous agent to operate safely in the 3D environment. Existing occupancy prediction methods are almost entirely trained on human-annotated volumetric data. Although of high quality, the generation of such 3D annotations is laborious and costly, restricting them to a few specific object categories in the training dataset. To address this limitation, this paper proposes Open Vocabulary Occupancy (OVO), a novel approach that allows semantic occupancy prediction of arbitrary classes but without the need for 3D annotations during training. Keys to our approach are (1) knowledge distillation from a pre-trained 2D open-vocabulary segmentation model to the 3D occupancy network, and (2) pixel-voxel filtering for high-quality training data generation. The resulting framework is simple, compact, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Multimodal Machine Learning Applications
MethodsKnowledge Distillation
