Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without   Manual Labels

Rui Huang; Songyou Peng; Ayca Takmaz; Federico Tombari; Marc; Pollefeys; Shiji Song; Gao Huang; Francis Engelmann

arXiv:2312.17232·cs.CV·December 29, 2023·2 cites

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

Rui Huang, Songyou Peng, Ayca Takmaz, Federico Tombari, Marc, Pollefeys, Shiji Song, Gao Huang, Francis Engelmann

PDF

Open Access

TL;DR

Segment3D leverages 2D foundation models to generate automatic, high-quality 3D segmentation masks, enabling fine-grained, class-agnostic 3D scene segmentation without manual labels, thus improving generalization and scalability.

Contribution

It introduces a novel approach that uses 2D foundation models to automatically generate training labels for 3D segmentation, eliminating manual annotation.

Findings

01

Outperforms existing 3D segmentation models on fine-grained masks.

02

Enables easy addition of new training data without manual labels.

03

Improves generalization to unseen object classes.

Abstract

Current 3D scene segmentation methods are heavily dependent on manually annotated 3D training datasets. Such manual annotations are labor-intensive, and often lack fine-grained details. Importantly, models trained on this data typically struggle to recognize object classes beyond the annotated classes, i.e., they do not generalize well to unseen domains and require additional domain-specific annotations. In contrast, 2D foundation models demonstrate strong generalization and impressive zero-shot abilities, inspiring us to incorporate these characteristics from 2D models into 3D models. Therefore, we explore the use of image segmentation foundation models to automatically generate training labels for 3D segmentation. We propose Segment3D, a method for class-agnostic 3D scene segmentation that produces high-quality 3D segmentation masks. It improves over existing 3D segmentation models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Infrastructure Maintenance and Monitoring