Patch-MoE Mamba: A Patch-Ordered Mixture-of-Experts State Space Architecture for Medical Image Segmentation

Diego Adame; Fabian Vazquez; Jose A. Nunez; Huimin Li; Jinghao Yang; Erik Enriquez; DongChul Kim; Haoteng Tang; Bin Fu; Pengfei Gu

arXiv:2605.17719·cs.CV·May 19, 2026

Patch-MoE Mamba: A Patch-Ordered Mixture-of-Experts State Space Architecture for Medical Image Segmentation

Diego Adame, Fabian Vazquez, Jose A. Nunez, Huimin Li, Jinghao Yang, Erik Enriquez, DongChul Kim, Haoteng Tang, Bin Fu, Pengfei Gu

PDF

TL;DR

Patch-MoE Mamba introduces a hierarchical patch-ordered scanning and adaptive fusion mechanism in a state space architecture, improving medical image segmentation by capturing multi-scale context and local spatial structure.

Contribution

It proposes a novel patch-ordered mixture-of-experts state space architecture that addresses limitations of existing models in preserving spatial structure and adaptive feature fusion.

Findings

01

Outperforms existing methods on five polyp segmentation benchmarks.

02

Achieves superior results on ISIC 2017/2018 skin lesion datasets.

03

Demonstrates generality and effectiveness across multiple medical imaging tasks.

Abstract

CNN- and Transformer-based architectures have achieved strong performance in medical image segmentation, but CNNs are limited in modeling long-range dependencies, while Transformers often suffer from quadratic computational and memory complexity. State space models, especially Mamba-based networks, offer an efficient alternative with linear sequence complexity. However, existing Mamba segmentation models still face two limitations: pixel-wise directional scanning can disrupt local 2D spatial structure, and simple summation-based fusion of scan directions cannot adapt well to diverse object sizes, shapes, and boundaries. To address these issues, we propose \textit{Patch-MoE Mamba}, a patch-ordered mixture-of-experts state space architecture for medical image segmentation. It introduces a hierarchical patch-ordered scanning mechanism that preserves local spatial neighborhoods while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.