ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion
Meng Zhou, Farzad Khalvati

TL;DR
ClinicalFMamba introduces a hybrid CNN-Mamba architecture for efficient multimodal 2D and 3D medical image fusion, enhancing diagnostic accuracy and enabling real-time clinical application.
Contribution
It presents a novel CNN-Mamba hybrid model with a tri-plane strategy for volumetric data, improving fusion quality and clinical validation over existing methods.
Findings
Superior fusion performance across multiple metrics
Achieves real-time processing suitable for clinical use
Improves downstream brain tumor classification accuracy
Abstract
Multimodal medical image fusion integrates complementary information from different imaging modalities to enhance diagnostic accuracy and treatment planning. While deep learning methods have advanced performance, existing approaches face critical limitations: Convolutional Neural Networks (CNNs) excel at local feature extraction but struggle to model global context effectively, while Transformers achieve superior long-range modeling at the cost of quadratic computational complexity, limiting clinical deployment. Recent State Space Models (SSMs) offer a promising alternative, enabling efficient long-range dependency modeling in linear time through selective scan mechanisms. Despite these advances, the extension to 3D volumetric data and the clinical validation of fused images remains underexplored. In this work, we propose ClinicalFMamba, a novel end-to-end CNN-Mamba hybrid architecture…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
