Towards Robust Semantic Segmentation of Accident Scenes via Multi-Source   Mixed Sampling and Meta-Learning

Xinyu Luo; Jiaming Zhang; Kailun Yang; Alina Roitberg; Kunyu Peng,; Rainer Stiefelhagen

arXiv:2203.10395·cs.CV·March 22, 2022

Towards Robust Semantic Segmentation of Accident Scenes via Multi-Source Mixed Sampling and Meta-Learning

Xinyu Luo, Jiaming Zhang, Kailun Yang, Alina Roitberg, Kunyu Peng,, Rainer Stiefelhagen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel multi-source meta-learning framework with mixed sampling and advanced decoding techniques to improve the robustness of semantic segmentation models in accident scenarios, significantly outperforming previous methods.

Contribution

The paper proposes MMUDA, a multi-source meta-learning unsupervised domain adaptation framework with hybrid decoding, enhancing segmentation robustness in abnormal accident scenes.

Findings

01

Achieves 46.97% mIoU on DADA-seg benchmark, surpassing previous state-of-the-art by 7.50%.

02

Utilizes multi-source mixed sampling to augment training data with abnormal scene appearances.

03

Incorporates a hybrid decoder with large window attention and strip pooling for better contextual understanding.

Abstract

Autonomous vehicles utilize urban scene segmentation to understand the real world like a human and react accordingly. Semantic segmentation of normal scenes has experienced a remarkable rise in accuracy on conventional benchmarks. However, a significant portion of real-life accidents features abnormal scenes, such as those with object deformations, overturns, and unexpected traffic behaviors. Since even small mis-segmentation of driving scenes can lead to serious threats to human lives, the robustness of such models in accident scenarios is an extremely important factor in ensuring safety of intelligent transportation systems. In this paper, we propose a Multi-source Meta-learning Unsupervised Domain Adaptation (MMUDA) framework, to improve the generalization of segmentation transformers to extreme accident scenes. In MMUDA, we make use of Multi-Domain Mixed Sampling to augment the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xinyu-laura/mmuda
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications

MethodsSpatial Pyramid Pooling