EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D   Medical Image Segmentation

Ao Chang; Jiajun Zeng; Ruobing Huang; and Dong Ni

arXiv:2409.17675·cs.CV·September 27, 2024

EM-Net: Efficient Channel and Frequency Learning with Mamba for 3D Medical Image Segmentation

Ao Chang, Jiajun Zeng, Ruobing Huang, and Dong Ni

PDF

Open Access 1 Repo

TL;DR

EM-Net introduces a novel Mamba-based approach for 3D medical image segmentation that improves accuracy, reduces model size, and accelerates training by leveraging channel and frequency domain learning.

Contribution

The paper presents a new Mamba-based model, EM-Net, that efficiently captures regional interactions and multi-scale features in 3D medical images, outperforming existing methods.

Findings

01

Achieves better segmentation accuracy than SOTA models.

02

Uses nearly half the parameters of comparable models.

03

Provides 2x faster training speed.

Abstract

Convolutional neural networks have primarily led 3D medical image segmentation but may be limited by small receptive fields. Transformer models excel in capturing global relationships through self-attention but are challenged by high computational costs at high resolutions. Recently, Mamba, a state space model, has emerged as an effective approach for sequential modeling. Inspired by its success, we introduce a novel Mamba-based 3D medical image segmentation model called EM-Net. It not only efficiently captures attentive interaction between regions by integrating and selecting channels, but also effectively utilizes frequency domain to harmonize the learning of features across varying scales, while accelerating training speed. Comprehensive experiments on two challenging multi-organ datasets with other state-of-the-art (SOTA) algorithms show that our method exhibits better segmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zang0902/EM-Net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI in cancer detection · Advanced Data Compression Techniques · Image Retrieval and Classification Techniques

MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Softmax · Layer Normalization · Dropout · Dense Connections