Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic   Segmentation

Yiyang Chen; Shanshan Zhao; Changxing Ding; Liyao Tang; Chaoyue Wang,; Dacheng Tao

arXiv:2308.02883·cs.CV·August 8, 2023

Cross-modal & Cross-domain Learning for Unsupervised LiDAR Semantic Segmentation

Yiyang Chen, Shanshan Zhao, Changxing Ding, Liyao Tang, Chaoyue Wang,, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces CoMoDaL, a novel method for unsupervised LiDAR semantic segmentation that leverages cross-modal and cross-domain learning to eliminate the need for source 3D data, using only 2D images with annotations and unannotated paired data.

Contribution

It proposes a new setting for 3D LiDAR segmentation that removes the requirement for source 3D data and introduces a cross-modal and cross-domain learning framework.

Findings

01

Achieves LiDAR segmentation without source 3D supervision

02

Effective cross-modal distillation between 2D images and 3D LiDAR data

03

Outperforms baseline methods in unsupervised setting

Abstract

In recent years, cross-modal domain adaptation has been studied on the paired 2D image and 3D LiDAR data to ease the labeling costs for 3D LiDAR semantic segmentation (3DLSS) in the target domain. However, in such a setting the paired 2D and 3D data in the source domain are still collected with additional effort. Since the 2D-3D projections can enable the 3D model to learn semantic information from the 2D counterpart, we ask whether we could further remove the need of source 3D data and only rely on the source 2D images. To answer it, this paper studies a new 3DLSS setting where a 2D dataset (source) with semantic annotations and a paired but unannotated 2D image and 3D LiDAR data (target) are available. To achieve 3DLSS in this scenario, we propose Cross-Modal and Cross-Domain Learning (CoMoDaL). Specifically, our CoMoDaL aims at modeling 1) inter-modal cross-domain distillation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications