OpenMask3D: Open-Vocabulary 3D Instance Segmentation
Ay\c{c}a Takmaz, Elisabetta Fedele, Robert W. Sumner, Marc Pollefeys,, Federico Tombari, Francis Engelmann

TL;DR
OpenMask3D is a zero-shot method for open-vocabulary 3D instance segmentation that leverages class-agnostic masks and multi-view CLIP-based features to identify diverse objects beyond predefined categories.
Contribution
It introduces a novel open-vocabulary 3D instance segmentation approach that can handle free-form queries and long-tail object distributions, surpassing existing methods.
Findings
Outperforms other open-vocabulary methods on ScanNet200 and Replica datasets.
Effectively segments objects based on geometry, affordances, and materials.
Demonstrates strong zero-shot capabilities in 3D scene understanding.
Abstract
We introduce the task of open-vocabulary 3D instance segmentation. Current approaches for 3D instance segmentation can typically only recognize object categories from a pre-defined closed set of classes that are annotated in the training datasets. This results in important limitations for real-world applications where one might need to perform tasks guided by novel, open-vocabulary queries related to a wide variety of objects. Recently, open-vocabulary 3D scene understanding methods have emerged to address this problem by learning queryable features for each point in the scene. While such a representation can be directly employed to perform semantic segmentation, existing methods cannot separate multiple object instances. In this work, we address this limitation, and propose OpenMask3D, which is a zero-shot approach for open-vocabulary 3D instance segmentation. Guided by predicted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Medical Image Segmentation Techniques
