Rethinking Multimodality: Optimizing Multimodal Deep Learning for Biomedical Signal Classification
Timothy Oladunni, Alex Wong

TL;DR
This paper critically examines multimodal deep learning for biomedical signals, revealing that combining multiple modalities doesn't always improve performance and emphasizing the importance of selecting complementary feature domains.
Contribution
It introduces a theoretical framework for identifying optimal domain combinations based on information-theoretic complementarity, challenging the assumption that more modalities always enhance accuracy.
Findings
Hybrid 1 outperforms single-modality models in ECG classification.
Adding frequency domain data sometimes reduces performance due to redundancy.
Theoretical framework quantifies ideal domain combinations based on complementarity.
Abstract
This study proposes a novel perspective on multimodal deep learning for biomedical signal classification, systematically analyzing how complementary feature domains impact model performance. While fusing multiple domains often presumes enhanced accuracy, this work demonstrates that adding modalities can yield diminishing returns, as not all fusions are inherently advantageous. To validate this, five deep learning models were designed, developed, and rigorously evaluated: three unimodal (1D-CNN for time, 2D-CNN for time-frequency, and 1D-CNN-Transformer for frequency) and two multimodal (Hybrid 1, which fuses 1D-CNN and 2D-CNN; Hybrid 2, which combines 1D-CNN, 2D-CNN, and a Transformer). For ECG classification, bootstrapping and Bayesian inference revealed that Hybrid 1 consistently outperformed the 2D-CNN baseline across all metrics (p-values < 0.05, Bayesian probabilities > 0.90),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsECG Monitoring and Analysis · EEG and Brain-Computer Interfaces · Cardiac electrophysiology and arrhythmias
