Data-Efficient Semantic Segmentation of 3D Point Clouds via Open-Vocabulary Image Segmentation-based Pseudo-Labeling

Takahiko Furuya

arXiv:2604.11007·cs.CV·April 14, 2026

Data-Efficient Semantic Segmentation of 3D Point Clouds via Open-Vocabulary Image Segmentation-based Pseudo-Labeling

Takahiko Furuya

PDF

TL;DR

This paper introduces PLOVIS, a novel data-efficient training framework for 3D point cloud semantic segmentation that leverages open-vocabulary image segmentation for pseudo-labeling, addressing data scarcity issues.

Contribution

PLOVIS uniquely combines pseudo-labeling from open-vocabulary models with a two-stage filtering and class-balanced memory bank to improve segmentation under limited data conditions.

Findings

01

PLOVIS outperforms existing methods on multiple benchmarks.

02

It effectively handles scarce training data and limited annotations.

03

The approach reduces pseudo-label noise and class imbalance.

Abstract

Semantic segmentation of 3D point cloud scenes is a crucial task for various applications. In real-world scenarios, training segmentation models often faces three concurrent forms of data insufficiency: scarcity of training scenes, scarcity of point-level annotations, and absence of 2D image sequences from which point clouds were reconstructed. Existing data-efficient algorithms typically address only one or two of these challenges, leaving the joint treatment of all three unexplored. This paper proposes a data-efficient training framework specifically designed to address the three forms of data insufficiency. Our proposed algorithm, called Point pseudo-Labeling via Open-Vocabulary Image Segmentation (PLOVIS), leverages an Open-Vocabulary Image Segmentation (OVIS) model as a pseudo label generator to compensate for the lack of training data. PLOVIS creates 2D images for pseudo-labeling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.