Towards Robust Semantic Segmentation of Accident Scenes via Multi-Source Mixed Sampling and Meta-Learning
Xinyu Luo, Jiaming Zhang, Kailun Yang, Alina Roitberg, Kunyu Peng,, Rainer Stiefelhagen

TL;DR
This paper introduces a novel multi-source meta-learning framework with mixed sampling and advanced decoding techniques to improve the robustness of semantic segmentation models in accident scenarios, significantly outperforming previous methods.
Contribution
The paper proposes MMUDA, a multi-source meta-learning unsupervised domain adaptation framework with hybrid decoding, enhancing segmentation robustness in abnormal accident scenes.
Findings
Achieves 46.97% mIoU on DADA-seg benchmark, surpassing previous state-of-the-art by 7.50%.
Utilizes multi-source mixed sampling to augment training data with abnormal scene appearances.
Incorporates a hybrid decoder with large window attention and strip pooling for better contextual understanding.
Abstract
Autonomous vehicles utilize urban scene segmentation to understand the real world like a human and react accordingly. Semantic segmentation of normal scenes has experienced a remarkable rise in accuracy on conventional benchmarks. However, a significant portion of real-life accidents features abnormal scenes, such as those with object deformations, overturns, and unexpected traffic behaviors. Since even small mis-segmentation of driving scenes can lead to serious threats to human lives, the robustness of such models in accident scenarios is an extremely important factor in ensuring safety of intelligent transportation systems. In this paper, we propose a Multi-source Meta-learning Unsupervised Domain Adaptation (MMUDA) framework, to improve the generalization of segmentation transformers to extreme accident scenes. In MMUDA, we make use of Multi-Domain Mixed Sampling to augment the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsSpatial Pyramid Pooling
