Incentivizing Cardiologist-Like Reasoning in MLLMs for Interpretable Echocardiographic Diagnosis
Yi Qin, Lehan Wang, Chenxu Zhao, Alex P.W. Lee, Xiaomeng Li

TL;DR
This paper introduces CardiacMind and CRT to improve multimodal large language models' ability to reason like cardiologists in echocardiographic diagnosis, achieving significant accuracy gains and clinician agreement.
Contribution
It proposes a novel cardiologist-like reasoning framework with reinforcement learning rewards to enhance MLLMs' interpretability and diagnostic accuracy in echocardiography.
Findings
48% improvement in multiview diagnosis accuracy
5% improvement on CardiacNet-PAH
93.33% clinician agreement with reasoning outputs
Abstract
Echocardiographic diagnosis is vital for cardiac screening yet remains challenging. Existing echocardiography foundation models do not effectively capture the relationships between quantitative measurements and clinical manifestations, whereas medical reasoning multimodal large language models (MLLMs) require costly construction of detailed reasoning paths and remain ineffective at directly incorporating such echocardiographic priors into their reasoning. To address these limitations, we propose a novel approach comprising Cardiac Reasoning Template (CRT) and CardiacMind to enhance MLLM's echocardiographic reasoning by introducing cardiologist-like mindset. Specifically, CRT provides stepwise canonical diagnostic procedures for complex cardiac diseases to streamline reasoning path construction without the need for costly case-by-case verification. To incentivize reasoning MLLM under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsECG Monitoring and Analysis · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
