Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation
Hao Tang, Lianglun Cheng, Guoheng Huang, Zhengguang Tan, Junhao Lu and, Kaihong Wu

TL;DR
This paper introduces TM-UNet, a novel medical image segmentation model that combines residual VSS Blocks and Triplet SSM to improve feature extraction and reduce parameters, outperforming previous models on multiple datasets.
Contribution
The paper proposes Triplet Mamba-UNet, integrating residual VSS Blocks and Triplet SSM for enhanced feature fusion and parameter efficiency in medical image segmentation.
Findings
Superior segmentation performance on multiple datasets.
One-third reduction in model parameters compared to VM-UNet.
Effective fusion of spatial and channel features.
Abstract
Image segmentation holds a vital position in the realms of diagnosis and treatment within the medical domain. Traditional convolutional neural networks (CNNs) and Transformer models have made significant advancements in this realm, but they still encounter challenges because of limited receptive field or high computing complexity. Recently, State Space Models (SSMs), particularly Mamba and its variants, have demonstrated notable performance in the field of vision. However, their feature extraction methods may not be sufficiently effective and retain some redundant structures, leaving room for parameter reduction. Motivated by previous spatial and channel attention methods, we propose Triplet Mamba-UNet. The method leverages residual VSS Blocks to extract intensive contextual features, while Triplet SSM is employed to fuse features across spatial and channel dimensions. We conducted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Brain Tumor Detection and Classification
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Byte Pair Encoding · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Multi-Head Attention · Softmax · Dropout
