De-biased Multimodal Electrocardiogram Analysis
Haitao Li, Ziyu Li, Yiheng Mao, Ziyi Liu, Zhoujian Sun, Zhengxing, Huang

TL;DR
This paper introduces a de-biased multimodal ECG analysis method that enhances ECG understanding in large language models by directly integrating ECG embeddings, addressing confounder bias, and improving robustness and zero-shot performance.
Contribution
It proposes a novel approach to directly incorporate ECG embeddings into LLMs and a de-bias pre-training method to mitigate confounder effects, improving ECG reasoning capabilities.
Findings
Model performs well on ECG-QA tasks under adversarial tests.
Effective in zero-shot ECG understanding.
Reduces reliance on spurious correlations.
Abstract
Multimodal large language models (MLLMs) are increasingly being applied in the medical field, particularly in medical imaging. However, developing MLLMs for ECG signals, which are crucial in clinical settings, has been a significant challenge beyond medical imaging. Previous studies have attempted to address this by converting ECGs into several text tags using an external classifier in a training-free manner. However, this approach significantly compresses the information in ECGs and underutilizes the reasoning capabilities of LLMs. In this work, we directly feed the embeddings of ECGs into the LLM through a projection layer, retaining more information about ECGs and better leveraging the reasoning abilities of LLMs. Our method can also effectively handle a common situation in clinical practice where it is necessary to compare two ECGs taken at different times. Recent studies found that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsECG Monitoring and Analysis
