MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic   Segmentation

Haozhi Cao; Yuecong Xu; Jianfei Yang; Pengyu Yin; Shenghai Yuan; Lihua; Xie

arXiv:2309.11839·cs.CV·September 22, 2023

MoPA: Multi-Modal Prior Aided Domain Adaptation for 3D Semantic Segmentation

Haozhi Cao, Yuecong Xu, Jianfei Yang, Pengyu Yin, Shenghai Yuan, Lihua, Xie

PDF

Open Access 1 Repo

TL;DR

This paper introduces MoPA, a novel multi-modal domain adaptation method that enhances 3D semantic segmentation, especially for rare objects, by using prior object insertion and 2D semantic masks to address class imbalance.

Contribution

MoPA proposes Valid Ground-based Insertion and SAM consistency loss to improve rare object segmentation in multi-modal unsupervised domain adaptation.

Findings

01

Achieves state-of-the-art results on MM-UDA benchmark.

02

Effectively improves rare object segmentation performance.

03

Utilizes prior object insertion and 2D semantic masks for better adaptation.

Abstract

Multi-modal unsupervised domain adaptation (MM-UDA) for 3D semantic segmentation is a practical solution to embed semantic understanding in autonomous systems without expensive point-wise annotations. While previous MM-UDA methods can achieve overall improvement, they suffer from significant class-imbalanced performance, restricting their adoption in real applications. This imbalanced performance is mainly caused by: 1) self-training with imbalanced data and 2) the lack of pixel-wise 2D supervision signals. In this work, we propose Multi-modal Prior Aided (MoPA) domain adaptation to improve the performance of rare objects. Specifically, we develop Valid Ground-based Insertion (VGI) to rectify the imbalance supervision signals by inserting prior rare objects collected from the wild while avoiding introducing artificial artifacts that lead to trivial solutions. Meanwhile, our SAM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

aroncao49/mopa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications

MethodsSegment Anything Model