FaRMamba: Frequency-based learning and Reconstruction aided Mamba for Medical Segmentation
Ze Rong, ZiYue Zhao, Zhaoxin Wang, Lei Ma

TL;DR
FaRMamba introduces a frequency-aware extension to the Mamba model, improving medical image segmentation by restoring high-frequency details and spatial correlations, leading to better boundary accuracy and detail preservation.
Contribution
It proposes two modules, MSFM and SSRAE, to explicitly address high-frequency information loss and spatial structure degradation in Mamba-based segmentation models.
Findings
Outperforms existing CNN-Transformer hybrid models.
Enhances boundary accuracy and detail preservation.
Demonstrates effectiveness across multiple medical imaging datasets.
Abstract
Accurate medical image segmentation remains challenging due to blurred lesion boundaries (LBA), loss of high-frequency details (LHD), and difficulty in modeling long-range anatomical structures (DC-LRSS). Vision Mamba employs one-dimensional causal state-space recurrence to efficiently model global dependencies, thereby substantially mitigating DC-LRSS. However, its patch tokenization and 1D serialization disrupt local pixel adjacency and impose a low-pass filtering effect, resulting in Local High-frequency Information Capture Deficiency (LHICD) and two-dimensional Spatial Structure Degradation (2D-SSD), which in turn exacerbate LBA and LHD. In this work, we propose FaRMamba, a novel extension that explicitly addresses LHICD and 2D-SSD through two complementary modules. A Multi-Scale Frequency Transform Module (MSFM) restores attenuated high-frequency cues by isolating and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
