xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic   Segmentation

Maximilian Jaritz; Tuan-Hung Vu; Raoul de Charette; \'Emilie Wirbel,; Patrick P\'erez

arXiv:1911.12676·cs.CV·April 1, 2020

xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation

Maximilian Jaritz, Tuan-Hung Vu, Raoul de Charette, \'Emilie Wirbel,, Patrick P\'erez

PDF

Open Access 1 Repo 1 Video

TL;DR

xMUDA introduces a novel cross-modal unsupervised domain adaptation method for 3D semantic segmentation that leverages mutual learning between 2D images and 3D point clouds, significantly improving performance across various domain shifts.

Contribution

The paper proposes a cross-modal UDA framework that enables 2D and 3D modalities to learn from each other through mutual mimicking, addressing heterogeneity and domain shift challenges.

Findings

01

xMUDA outperforms uni-modal UDA methods on multiple domain shift scenarios.

02

Mutual mimicking improves segmentation accuracy across modalities.

03

The approach is complementary to existing state-of-the-art UDA techniques.

Abstract

Unsupervised Domain Adaptation (UDA) is crucial to tackle the lack of annotations in a new domain. There are many multi-modal datasets, but most UDA approaches are uni-modal. In this work, we explore how to learn from multi-modality and propose cross-modal UDA (xMUDA) where we assume the presence of 2D images and 3D point clouds for 3D semantic segmentation. This is challenging as the two input spaces are heterogeneous and can be impacted differently by domain shift. In xMUDA, modalities learn from each other through mutual mimicking, disentangled from the segmentation objective, to prevent the stronger modality from adopting false predictions from the weaker one. We evaluate on new UDA scenarios including day-to-night, country-to-country and dataset-to-dataset, leveraging recent autonomous driving datasets. xMUDA brings large improvements over uni-modal UDA on all tested scenarios, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

valeoai/xmuda
pytorchOfficial

Videos

xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation· youtube

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research