CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement
Zheng Wu, Yiping Xie, Bo Zhao, Jiguang He, Fei Luo, Ning Deng, Zitong, Yu

TL;DR
CardiacMamba is a multimodal RGB-RF fusion framework that enhances remote heart rate measurement by integrating novel modules for dynamic RF analysis, cross-modal alignment, and frequency domain refinement, achieving state-of-the-art accuracy and robustness.
Contribution
It introduces a novel multimodal fusion framework with specialized modules for improved heart rate estimation and fairness across diverse populations.
Findings
Achieves state-of-the-art accuracy on the EquiPleth dataset.
Reduces skin tone bias and improves robustness under missing-modality scenarios.
Enhances periodicity detection and generalization in remote physiological measurement.
Abstract
Heart rate (HR) estimation via remote photoplethysmography (rPPG) offers a non-invasive solution for health monitoring. However, traditional single-modality approaches (RGB or Radio Frequency (RF)) face challenges in balancing robustness and accuracy due to lighting variations, motion artifacts, and skin tone bias. In this paper, we propose CardiacMamba, a multimodal RGB-RF fusion framework that leverages the complementary strengths of both modalities. It introduces the Temporal Difference Mamba Module (TDMM) to capture dynamic changes in RF signals using timing differences between frames, enhancing the extraction of local and global features. Additionally, CardiacMamba employs a Bidirectional SSM for cross-modal alignment and a Channel-wise Fast Fourier Transform (CFFT) to effectively capture and refine the frequency domain characteristics of RGB and RF signals, ultimately improving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNon-Invasive Vital Sign Monitoring · ECG Monitoring and Analysis · Wireless Body Area Networks
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces
