Mammo-Mamba: A Hybrid State-Space and Transformer Architecture with Sequential Mixture of Experts for Multi-View Mammography

Farnoush Bayatmakou; Reza Taleei; Nicole Simone; Arash Mohammadi

arXiv:2507.17662·eess.IV·July 24, 2025

Mammo-Mamba: A Hybrid State-Space and Transformer Architecture with Sequential Mixture of Experts for Multi-View Mammography

Farnoush Bayatmakou, Reza Taleei, Nicole Simone, Arash Mohammadi

PDF

Open Access

TL;DR

Mammo-Mamba is a novel hybrid architecture combining state-space models, transformers, and mixture of experts to improve multi-view mammogram classification efficiency and accuracy.

Contribution

It introduces the SecMamba block with Sequential Mixture of Experts, enhancing high-resolution mammogram feature learning within a unified framework.

Findings

01

Achieves superior classification performance on CBIS-DDSM dataset.

02

Maintains computational efficiency compared to traditional Transformer models.

03

Effectively mitigates Transformer limitations with dynamic expert gating.

Abstract

Breast cancer (BC) remains one of the leading causes of cancer-related mortality among women, despite recent advances in Computer-Aided Diagnosis (CAD) systems. Accurate and efficient interpretation of multi-view mammograms is essential for early detection, driving a surge of interest in Artificial Intelligence (AI)-powered CAD models. While state-of-the-art multi-view mammogram classification models are largely based on Transformer architectures, their computational complexity scales quadratically with the number of image patches, highlighting the need for more efficient alternatives. To address this challenge, we propose Mammo-Mamba, a novel framework that integrates Selective State-Space Models (SSMs), transformer-based attention, and expert-driven feature refinement into a unified architecture. Mammo-Mamba extends the MambaVision backbone by introducing the Sequential Mixture of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Compression Techniques