Multimodal Foundation Models For Echocardiogram Interpretation
Matthew Christensen, Milos Vukadinovic, Neal Yuan, David Ouyang

TL;DR
This paper introduces EchoCLIP, a multimodal foundation model trained on extensive cardiac ultrasound data, demonstrating strong zero-shot performance in cardiac assessment and report generation, advancing automated echocardiogram interpretation.
Contribution
Developed EchoCLIP, a novel multimodal foundation model for echocardiography, leveraging large-scale cardiac ultrasound videos and expert interpretations to enable zero-shot diagnostic capabilities.
Findings
Achieved a mean absolute error of 7.1% in cardiac function assessment.
AUC of 0.84-0.98 for identifying intracardiac devices.
High accuracy in patient identification and clinical change detection.
Abstract
Multimodal deep learning foundation models can learn the relationship between images and text. In the context of medical imaging, mapping images to language concepts reflects the clinical task of diagnostic image interpretation, however current general-purpose foundation models do not perform well in this context because their training corpus have limited medical text and images. To address this challenge and account for the range of cardiac physiology, we leverage 1,032,975 cardiac ultrasound videos and corresponding expert interpretations to develop EchoCLIP, a multimodal foundation model for echocardiography. EchoCLIP displays strong zero-shot (not explicitly trained) performance in cardiac function assessment (external validation left ventricular ejection fraction mean absolute error (MAE) of 7.1%) and identification of implanted intracardiac devices (areas under the curve (AUC)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · Cardiac Valve Diseases and Treatments · Artificial Intelligence in Healthcare and Education
