A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis

Yuting Zhang; Tiantian Geng; Luoying Hao; Xinxing Cheng; Alexander Thorley; Xiaoxia Wang; Wenqi Lu; Sandeep S Hothi; Lei Wei; Zhaowen Qiu; Dipak Kotecha; and Jinming Duan

arXiv:2508.13072·cs.AI·August 19, 2025

A Language-Signal-Vision Multimodal Framework for Multitask Cardiac Analysis

Yuting Zhang, Tiantian Geng, Luoying Hao, Xinxing Cheng, Alexander Thorley, Xiaoxia Wang, Wenqi Lu, Sandeep S Hothi, Lei Wei, Zhaowen Qiu, Dipak Kotecha, and Jinming Duan

PDF

TL;DR

This paper introduces TGMM, a comprehensive multimodal framework integrating laboratory tests, ECGs, and echocardiograms, to improve multitask cardiac analysis through dynamic data fusion and textual guidance, outperforming existing methods.

Contribution

The study presents a novel unified multimodal framework with a MedFlexFusion module and textual guidance, addressing data scarcity and modality integration challenges in cardiac analysis.

Findings

01

TGMM outperforms state-of-the-art methods across multiple tasks.

02

The framework demonstrates robustness on a public dataset.

03

Key modality features synergistically enhance clinical decision-making.

Abstract

Contemporary cardiovascular management involves complex consideration and integration of multimodal cardiac datasets, where each modality provides distinct but complementary physiological characteristics. While the effective integration of multiple modalities could yield a holistic clinical profile that accurately models the true clinical situation with respect to data modalities and their relatives weightings, current methodologies remain limited by: 1) the scarcity of patient- and time-aligned multimodal data; 2) reliance on isolated single-modality or rigid multimodal input combinations; 3) alignment strategies that prioritize cross-modal similarity over complementarity; and 4) a narrow single-task focus. In response to these limitations, a comprehensive multimodal dataset was curated for immediate application, integrating laboratory test results, electrocardiograms, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.