ACM-UNet: Adaptive Integration of CNNs and Mamba for Efficient Medical Image Segmentation
Jing Huang, Yongkang Zhao, Yuhan Li, Zhitao Dai, Cheng Chen, Qiying Lai

TL;DR
ACM-UNet is a versatile medical image segmentation framework that effectively combines pretrained CNNs and Mamba SSMs through a lightweight adapter, achieving state-of-the-art results with high efficiency.
Contribution
It introduces a novel adapter mechanism to integrate CNNs and Mamba models into a UNet-like architecture for improved segmentation performance.
Findings
Achieves 85.12% Dice Score on Synapse dataset.
Reaches 13.89mm HD95 with 17.93G FLOPs.
Outperforms existing methods on benchmark datasets.
Abstract
The U-shaped encoder-decoder architecture with skip connections has become a prevailing paradigm in medical image segmentation due to its simplicity and effectiveness. While many recent works aim to improve this framework by designing more powerful encoders and decoders, employing advanced convolutional neural networks (CNNs) for local feature extraction, Transformers or state space models (SSMs) such as Mamba for global context modeling, or hybrid combinations of both, these methods often struggle to fully utilize pretrained vision backbones (e.g., ResNet, ViT, VMamba) due to structural mismatches. To bridge this gap, we introduce ACM-UNet, a general-purpose segmentation framework that retains a simple UNet-like design while effectively incorporating pretrained CNNs and Mamba models through a lightweight adapter mechanism. This adapter resolves architectural incompatibilities and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · AI in cancer detection · Medical Imaging and Analysis
MethodsAverage Pooling · Convolution · Kaiming Initialization · Adapter · Global Average Pooling · Max Pooling · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
