Towards Semi-supervised Dual-modal Semantic Segmentation
Qiulei Dong, Jianan Li, and Shuang Deng

TL;DR
This paper introduces PD-Net, a semi-supervised dual-modal segmentation network that effectively leverages limited labeled data and abundant unlabeled data from 3D point clouds and 2D images, improving segmentation performance.
Contribution
The paper proposes a novel parallel dual-stream network with pseudo-label prediction and optimization modules for semi-supervised dual-modal semantic segmentation.
Findings
Outperforms existing semi-supervised methods on public datasets.
Achieves competitive results with fully-supervised methods.
Effectively utilizes unlabeled data to enhance segmentation accuracy.
Abstract
With the development of 3D and 2D data acquisition techniques, it has become easy to obtain point clouds and images of scenes simultaneously, which further facilitates dual-modal semantic segmentation. Most existing methods for simultaneously segmenting point clouds and images rely heavily on the quantity and quality of the labeled training data. However, massive point-wise and pixel-wise labeling procedures are time-consuming and labor-intensive. To address this issue, we propose a parallel dual-stream network to handle the semi-supervised dual-modal semantic segmentation task, called PD-Net, by jointly utilizing a small number of labeled point clouds, a large number of unlabeled point clouds, and unlabeled images. The proposed PD-Net consists of two parallel streams (called original stream and pseudo-label prediction stream). The pseudo-label prediction stream predicts the pseudo…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
