TL;DR
MambaBack is a hybrid architecture for Whole Slide Image analysis that preserves spatial locality, enhances local and global feature extraction, and reduces memory usage during inference.
Contribution
It introduces a novel hybrid model combining Mamba and MambaOut with Hilbert sampling and hierarchical structures for improved WSI analysis.
Findings
Outperforms seven state-of-the-art methods on five datasets.
Effectively preserves 2D spatial locality with Hilbert sampling.
Reduces peak memory usage during inference with asymmetric chunking.
Abstract
Whole Slide Image (WSI) analysis is pivotal in computational pathology, enabling cancer diagnosis by integrating morphological and architectural cues across magnifications. Multiple Instance Learning (MIL) serves as the standard framework for WSI analysis. Recently, Mamba has become a promising backbone for MIL, overtaking Transformers due to its efficiency and global context modeling capabilities originating from Natural Language Processing (NLP). However, existing Mamba-based MIL approaches face three critical challenges: (1) disruption of 2D spatial locality during 1D sequence flattening; (2) sub-optimal modeling of fine-grained local cellular structures; and (3) high memory peaks during inference on resource-constrained edge devices. Studies like MambaOut reveal that Mamba's SSM component is redundant for local feature extraction, where Gated CNNs suffice. Recognizing that WSI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
