LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation
Trung Dinh Quoc Dang, Huy Hoang Nguyen, Aleksei Tiulpin

TL;DR
LoG-VMamba introduces a novel local-global vision Mamba model that efficiently captures local and global dependencies in 2D and 3D medical images, outperforming CNNs and Transformers in segmentation tasks.
Contribution
The paper presents LoG-VMamba, a new SSM-based model that explicitly enforces local and global token dependencies with simple scanning, improving efficiency and accuracy in medical image segmentation.
Findings
Outperforms CNN and Transformer baselines on diverse MIS tasks
Achieves global receptive fields with linear complexity in tokens
Efficiently models local-global dependencies in high-dimensional images
Abstract
Mamba, a State Space Model (SSM), has recently shown competitive performance to Convolutional Neural Networks (CNNs) and Transformers in Natural Language Processing and general sequence modeling. Various attempts have been made to adapt Mamba to Computer Vision tasks, including medical image segmentation (MIS). Vision Mamba (VM)-based networks are particularly attractive due to their ability to achieve global receptive fields, similar to Vision Transformers, while also maintaining linear complexity in the number of tokens. However, the existing VM models still struggle to maintain both spatially local and global dependencies of tokens in high dimensional arrays due to their sequential nature. Employing multiple and/or complicated scanning strategies is computationally costly, which hinders applications of SSMs to high-dimensional 2D and 3D images that are common in MIS problems. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques
MethodsSparse Evolutionary Training · Mamba: Linear-Time Sequence Modeling with Selective State Spaces
