Augmentation-based Domain Generalization for Semantic Segmentation
Manuel Schwonberg, Fadoua El Bouazati, Nico M. Schmidt, Hanno, Gottschalk

TL;DR
This paper systematically evaluates simple image augmentations for domain generalization in semantic segmentation, showing that combined augmentations can rival complex methods without additional training overhead.
Contribution
It provides a comprehensive statistical analysis of simple augmentation techniques and demonstrates their competitive performance against state-of-the-art methods.
Findings
Combined augmentations outperform single augmentations.
Simple augmentations perform competitively with complex DG methods.
Using vision transformers further improves segmentation performance.
Abstract
Unsupervised Domain Adaptation (UDA) and domain generalization (DG) are two research areas that aim to tackle the lack of generalization of Deep Neural Networks (DNNs) towards unseen domains. While UDA methods have access to unlabeled target images, domain generalization does not involve any target data and only learns generalized features from a source domain. Image-style randomization or augmentation is a popular approach to improve network generalization without access to the target domain. Complex methods are often proposed that disregard the potential of simple image augmentations for out-of-domain generalization. For this reason, we systematically study the in- and out-of-domain generalization capabilities of simple, rule-based image augmentations like blur, noise, color jitter and many more. Based on a full factorial design of experiment design we provide a systematic statistical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and ELM
MethodsMulti-Head Attention · Attention Is All You Need · Residual Connection · Softmax · Linear Layer · Layer Normalization · Dense Connections · Vision Transformer
