SMAFormer: Synergistic Multi-Attention Transformer for Medical Image Segmentation

Fuchen Zheng; Xuhang Chen; Weihuang Liu; Haolun Li; Yingtie Lei; Jiahui He; Chi-Man Pun; Shounjun Zhou

arXiv:2409.00346·cs.CV·July 31, 2025·2 cites

SMAFormer: Synergistic Multi-Attention Transformer for Medical Image Segmentation

Fuchen Zheng, Xuhang Chen, Weihuang Liu, Haolun Li, Yingtie Lei, Jiahui He, Chi-Man Pun, Shounjun Zhou

PDF

Open Access 2 Repos 1 Models

TL;DR

SMAFormer is a novel Transformer-based architecture that combines multiple attention mechanisms and a feature fusion module to improve the segmentation of small, irregularly shaped tumors in medical images, achieving state-of-the-art results.

Contribution

The paper introduces SMAFormer, a new architecture that synergistically integrates pixel, channel, and spatial attention with a feature fusion modulator for enhanced medical image segmentation.

Findings

01

Achieves state-of-the-art performance on multiple segmentation tasks.

02

Effectively captures local and global features for small tumor segmentation.

03

Demonstrates robustness across diverse medical imaging datasets.

Abstract

In medical image segmentation, specialized computer vision techniques, notably transformers grounded in attention mechanisms and residual networks employing skip connections, have been instrumental in advancing performance. Nonetheless, previous models often falter when segmenting small, irregularly shaped tumors. To this end, we introduce SMAFormer, an efficient, Transformer-based architecture that fuses multiple attention mechanisms for enhanced segmentation of small tumors and organs. SMAFormer can capture both local and global features for medical image segmentation. The architecture comprises two pivotal components. First, a Synergistic Multi-Attention (SMA) Transformer block is proposed, which has the benefits of Pixel Attention, Channel Attention, and Spatial Attention for feature enrichment. Second, addressing the challenge of information loss incurred during attention mechanism…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
lzeeorno666/SMAFormer-Medical-Image-Segmentation
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification

MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Layer Normalization · Dropout · Position-Wise Feed-Forward Layer · Residual Connection · Linear Layer