M3-TransUNet: Medical Image Segmentation Based on Spatial Prior Attention and Multi-Scale Gating
Zhigao Zeng, Jiale Xiao, Shengqiu Yi, Qiang Liu, Yanhui Zhu

TL;DR
This paper introduces M3-TransUNet, a new model for medical image segmentation that improves accuracy and reduces errors using advanced attention mechanisms.
Contribution
The novel M3-TransUNet architecture introduces three new modules for better multi-scale feature representation and spatial modeling in medical image segmentation.
Findings
M3-TransUNet outperforms recent TransUNet variants on the Synapse dataset with an average DSC of 82.79%.
The model significantly reduces the average HD95 from 19.74 mm to 10.21 mm on the Synapse dataset.
Extensive experiments on public datasets demonstrate state-of-the-art performance for medical image segmentation.
Abstract
Medical image segmentation presents substantial challenges arising from the diverse scales and morphological complexities of target anatomical structures. Although existing Transformer-based models excel at capturing global dependencies, they encounter critical bottlenecks in multi-scale feature representation, spatial relationship modeling, and cross-layer feature fusion. To address these limitations, we propose the M3-TransUNet architecture, which incorporates three key innovations: (1) MSGA (Multi-Scale Gate Attention) and MSSA (Multi-Scale Selective Attention) modules to enhance multi-scale feature representation; (2) ME-MSA (Manhattan Enhanced Multi-Head Self-Attention) to integrate spatial priors into self-attention computations, thereby overcoming spatial modeling deficiencies; and (3) MKGAG (Multi-kernel Gated Attention Gate) to optimize skip connections by precisely filtering…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Imaging and Analysis · COVID-19 diagnosis using AI
