DAE-Former: Dual Attention-guided Efficient Transformer for Medical Image Segmentation
Reza Azad, Ren\'e Arimond, Ehsan Khodapanah Aghdam, Amirhossein, Kazerouni, Dorit Merhof

TL;DR
DAE-Former introduces an efficient dual attention mechanism for medical image segmentation, capturing spatial and channel relations with reduced computational cost, outperforming existing methods without pre-training.
Contribution
It reformulates self-attention to be more efficient and integrates cross-attention in skip connections, enhancing segmentation accuracy in medical images.
Findings
Outperforms state-of-the-art on multi-organ cardiac segmentation
Achieves superior results on skin lesion segmentation
Operates efficiently without pre-training
Abstract
Transformers have recently gained attention in the computer vision domain due to their ability to model long-range dependencies. However, the self-attention mechanism, which is the core part of the Transformer model, usually suffers from quadratic computational complexity with respect to the number of tokens. Many architectures attempt to reduce model complexity by limiting the self-attention mechanism to local regions or by redesigning the tokenization process. In this paper, we propose DAE-Former, a novel method that seeks to provide an alternative perspective by efficiently designing the self-attention mechanism. More specifically, we reformulate the self-attention mechanism to capture both spatial and channel relations across the whole feature dimension while staying computationally efficient. Furthermore, we redesign the skip connection path by including the cross-attention module…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Radiomics and Machine Learning in Medical Imaging · Medical Imaging and Analysis
MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Dense Connections · Linear Layer · Concatenated Skip Connection · Layer Normalization · Adam · Byte Pair Encoding · Residual Connection
