Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation
Yiyang Chen, Shanshan Zhao, Changxing Ding, Liyao Tang, Chaoyue Wang,, Dacheng Tao

TL;DR
This paper introduces CoMoDaL, a novel method for unsupervised LiDAR semantic segmentation that leverages cross-modal and cross-domain learning to eliminate the need for source 3D data, using only 2D images with annotations and unannotated paired data.
Contribution
It proposes a new setting for 3D LiDAR segmentation that removes the requirement for source 3D data and introduces a cross-modal and cross-domain learning framework.
Findings
Achieves LiDAR segmentation without source 3D supervision
Effective cross-modal distillation between 2D images and 3D LiDAR data
Outperforms baseline methods in unsupervised setting
Abstract
In recent years, cross-modal domain adaptation has been studied on the paired 2D image and 3D LiDAR data to ease the labeling costs for 3D LiDAR semantic segmentation (3DLSS) in the target domain. However, in such a setting the paired 2D and 3D data in the source domain are still collected with additional effort. Since the 2D-3D projections can enable the 3D model to learn semantic information from the 2D counterpart, we ask whether we could further remove the need of source 3D data and only rely on the source 2D images. To answer it, this paper studies a new 3DLSS setting where a 2D dataset (source) with semantic annotations and a paired but unannotated 2D image and 3D LiDAR data (target) are available. To achieve 3DLSS in this scenario, we propose Cross-Modal and Cross-Domain Learning (CoMoDaL). Specifically, our CoMoDaL aims at modeling 1) inter-modal cross-domain distillation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
