High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation

Lunhao Duan; Shanshan Zhao; Xingxing Weng; Jing Zhang; Gui-Song Xia

arXiv:2506.23227·cs.CV·July 1, 2025

High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation

Lunhao Duan, Shanshan Zhao, Xingxing Weng, Jing Zhang, Gui-Song Xia

PDF

TL;DR

This paper introduces a novel framework for indoor point cloud segmentation using scene-level annotations, combining multi-modal features and semantic consistency to generate high-quality pseudo-labels and improve segmentation accuracy.

Contribution

It proposes a new pseudo-label generation method leveraging cross-modal guidance and region-point semantic consistency for scene-level annotated point cloud segmentation.

Findings

01

Significant performance improvements on ScanNet v2 and S3DIS datasets.

02

Effective pseudo-label refinement during training.

03

Validation of individual component contributions through ablation studies.

Abstract

This paper investigates indoor point cloud semantic segmentation under scene-level annotation, which is less explored compared to methods relying on sparse point-level labels. In the absence of precise point-level labels, current methods first generate point-level pseudo-labels, which are then used to train segmentation models. However, generating accurate pseudo-labels for each point solely based on scene-level annotations poses a considerable challenge, substantially affecting segmentation performance. Consequently, to enhance accuracy, this paper proposes a high-quality pseudo-label generation framework by exploring contemporary multi-modal information and region-point semantic consistency. Specifically, with a cross-modal feature guidance module, our method utilizes 2D-3D correspondences to align point cloud features with corresponding 2D image pixels, thereby assisting point cloud…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.