Patch-MoE Mamba: A Patch-Ordered Mixture-of-Experts State Space Architecture for Medical Image Segmentation
Diego Adame, Fabian Vazquez, Jose A. Nunez, Huimin Li, Jinghao Yang, Erik Enriquez, DongChul Kim, Haoteng Tang, Bin Fu, Pengfei Gu

TL;DR
Patch-MoE Mamba introduces a hierarchical patch-ordered scanning and adaptive fusion mechanism in a state space architecture, improving medical image segmentation by capturing multi-scale context and local spatial structure.
Contribution
It proposes a novel patch-ordered mixture-of-experts state space architecture that addresses limitations of existing models in preserving spatial structure and adaptive feature fusion.
Findings
Outperforms existing methods on five polyp segmentation benchmarks.
Achieves superior results on ISIC 2017/2018 skin lesion datasets.
Demonstrates generality and effectiveness across multiple medical imaging tasks.
Abstract
CNN- and Transformer-based architectures have achieved strong performance in medical image segmentation, but CNNs are limited in modeling long-range dependencies, while Transformers often suffer from quadratic computational and memory complexity. State space models, especially Mamba-based networks, offer an efficient alternative with linear sequence complexity. However, existing Mamba segmentation models still face two limitations: pixel-wise directional scanning can disrupt local 2D spatial structure, and simple summation-based fusion of scan directions cannot adapt well to diverse object sizes, shapes, and boundaries. To address these issues, we propose \textit{Patch-MoE Mamba}, a patch-ordered mixture-of-experts state space architecture for medical image segmentation. It introduces a hierarchical patch-ordered scanning mechanism that preserves local spatial neighborhoods while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
