IBAFormer: Intra-batch Attention Transformer for Domain Generalized   Semantic Segmentation

Qiyu Sun; Huilin Chen; Meng Zheng; Ziyan Wu; Michael Felsberg; Yang; Tang

arXiv:2309.06282·cs.CV·September 13, 2023·1 cites

IBAFormer: Intra-batch Attention Transformer for Domain Generalized Semantic Segmentation

Qiyu Sun, Huilin Chen, Meng Zheng, Ziyan Wu, Michael Felsberg, Yang, Tang

PDF

Open Access

TL;DR

IBAFormer introduces intra-batch attention mechanisms to Transformer models, leveraging inter-sample correlations to significantly improve domain generalization in semantic segmentation tasks.

Contribution

The paper proposes novel intra-batch attention modules, MIBA and EIBA, and integrates them into Transformer networks, advancing DGSS performance beyond existing methods.

Findings

01

Achieves state-of-the-art results in DGSS

02

Demonstrates effectiveness of intra-batch attention modules

03

Shows improved generalization over CNN-based models

Abstract

Domain generalized semantic segmentation (DGSS) is a critical yet challenging task, where the model is trained only on source data without access to any target data. Despite the proposal of numerous DGSS strategies, the generalization capability remains limited in CNN architectures. Though some Transformer-based segmentation models show promising performance, they primarily focus on capturing intra-sample attentive relationships, disregarding inter-sample correlations which can potentially benefit DGSS. To this end, we enhance the attention modules in Transformer networks for improving DGSS by incorporating information from other independent samples in the same batch, enriching contextual information, and diversifying the training data for each attention block. Specifically, we propose two alternative intra-batch attention mechanisms, namely mean-based intra-batch attention (MIBA) and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Medical Imaging and Analysis

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Adam · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Absolute Position Encodings