Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation

Maximilian Jaritz; Tuan-Hung Vu; Raoul de Charette; \'Emilie Wirbel,; and Patrick P\'erez

arXiv:2101.07253·cs.CV·June 23, 2022

Cross-modal Learning for Domain Adaptation in 3D Semantic Segmentation

Maximilian Jaritz, Tuan-Hung Vu, Raoul de Charette, \'Emilie Wirbel,, and Patrick P\'erez

PDF

3 Repos

TL;DR

This paper introduces a cross-modal learning approach for domain adaptation in 3D semantic segmentation, leveraging multi-modal data to improve performance across various challenging scenarios.

Contribution

It proposes a novel cross-modal learning strategy that enforces consistency between modalities via mutual mimicking for domain adaptation.

Findings

01

Significant improvement over uni-modal baselines in multiple scenarios

02

Effective in unsupervised and semi-supervised settings

03

Robust across different domain shifts such as weather and sensor changes

Abstract

Domain adaptation is an important task to enable learning when labels are scarce. While most works focus only on the image modality, there are many important multi-modal datasets. In order to leverage multi-modality for domain adaptation, we propose cross-modal learning, where we enforce consistency between the predictions of two modalities via mutual mimicking. We constrain our network to make correct predictions on labeled data and consistent predictions across modalities on unlabeled target-domain data. Experiments in unsupervised and semi-supervised domain adaptation settings prove the effectiveness of this novel domain adaptation strategy. Specifically, we evaluate on the task of 3D semantic segmentation from either the 2D image, the 3D point cloud or from both. We leverage recent driving datasets to produce a wide variety of domain adaptation scenarios including changes in scene…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.