Zero-shot point cloud segmentation by transferring geometric primitives
Runnan Chen, Xinge Zhu, Nenglun Chen, Wei Li, Yuexin Ma, Ruigang Yang,, Wenping Wang

TL;DR
This paper introduces a zero-shot point cloud segmentation method that leverages geometric primitives and language alignment to recognize unseen objects, significantly outperforming existing approaches across multiple datasets.
Contribution
It proposes a novel framework that learns shared geometric primitives and aligns them with language, enabling zero-shot segmentation of unseen 3D objects.
Findings
Achieves up to 30.4% improvement in harmonic mean-IoU on ScanNet.
Outperforms state-of-the-art methods on S3DIS, SemanticKITTI, and nuScenes datasets.
Introduces a new Unknown-aware InfoNCE Loss for fine-grained language-visual alignment.
Abstract
We investigate transductive zero-shot point cloud semantic segmentation, where the network is trained on seen objects and able to segment unseen objects. The 3D geometric elements are essential cues to imply a novel 3D object type. However, previous methods neglect the fine-grained relationship between the language and the 3D geometric elements. To this end, we propose a novel framework to learn the geometric primitives shared in seen and unseen categories' objects and employ a fine-grained alignment between language and the learned geometric primitives. Therefore, guided by language, the network recognizes the novel objects represented with geometric primitives. Specifically, we formulate a novel point visual representation, the similarity vector of the point's feature to the learnable prototypes, where the prototypes automatically encode geometric primitives via back-propagation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Neural Network Applications · 3D Surveying and Cultural Heritage
MethodsALIGN · InfoNCE
