Leveraging Foundational Models and Simple Fusion for Multi-modal Physiological Signal Analysis
Youssef Ghallab, Omar Iraqy, Mohamed Kandil, Mohamed Ashraf, Saadeldine Eletter, Morougue Ghazal, Ayman Khalafallah, Nagwa El-Makky

TL;DR
This paper introduces a foundation-model-based approach for multi-modal physiological signal analysis, using simple fusion of pre-trained ECG and EEG encoders to improve emotion recognition with limited labeled data.
Contribution
It adapts the CBraMod encoder for large-scale self-supervised ECG pretraining and combines it with a pre-trained EEG encoder, enabling effective multi-modal integration with simple concatenation.
Findings
Achieves near state-of-the-art emotion recognition performance.
Demonstrates the effectiveness of foundation models in physiological signal analysis.
Shows simple embedding fusion can leverage multi-modal representations effectively.
Abstract
Physiological signals such as electrocardiograms (ECG) and electroencephalograms (EEG) provide complementary insights into human health and cognition, yet multi-modal integration is challenging due to limited multi-modal labeled data, and modality-specific differences . In this work, we adapt the CBraMod encoder for large-scale self-supervised ECG pretraining, introducing a dual-masking strategy to capture intra- and inter-lead dependencies. To overcome the above challenges, we utilize a pre-trained CBraMod encoder for EEG and pre-train a symmetric ECG encoder, equipping each modality with a rich foundational representation. These representations are then fused via simple embedding concatenation, allowing the classification head to learn cross-modal interactions, together enabling effective downstream learning despite limited multi-modal supervision. Evaluated on emotion recognition,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · EEG and Brain-Computer Interfaces · ECG Monitoring and Analysis
