Class-agnostic 3D Segmentation by Granularity-Consistent Automatic 2D Mask Tracking
Juan Wang, Yasutomo Kawanishi, Tomo Miyazaki, Zhijie Wang, Shinichiro Omachi

TL;DR
This paper introduces a novel 3D segmentation method that maintains temporal consistency in 2D mask tracking and employs curriculum learning to improve 3D pseudo-label quality, achieving state-of-the-art results.
Contribution
It proposes a granularity-consistent 2D mask tracking technique combined with a three-stage curriculum learning framework for improved 3D segmentation.
Findings
Achieved state-of-the-art performance on standard benchmarks.
Generated more consistent and accurate 3D segmentations.
Demonstrated robustness in open-vocabulary scenarios.
Abstract
3D instance segmentation is an important task for real-world applications. To avoid costly manual annotations, existing methods have explored generating pseudo labels by transferring 2D masks from foundation models to 3D. However, this approach is often suboptimal since the video frames are processed independently. This causes inconsistent segmentation granularity and conflicting 3D pseudo labels, which degrades the accuracy of final segmentation. To address this, we introduce a Granularity-Consistent automatic 2D Mask Tracking approach that maintains temporal correspondences across frames, eliminating conflicting pseudo labels. Combined with a three-stage curriculum learning framework, our approach progressively trains from fragmented single-view data to unified multi-view annotations, ultimately globally coherent full-scene supervision. This structured learning pipeline enables the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Neural Network Applications · Medical Image Segmentation Techniques
