# Cardiac magnetic resonance imaging-large language model Meta AI: a finetuned large language model for identifying findings and associated attributes in cardiac magnetic resonance imaging reports

**Authors:** Michelle Z. Fang, Makiya Nakashima, Kailash Singh, Eileen Galvani, Xiaotan Sun, Sharmeen Sorathia, Kevin Dorocak, Deborah Kwon, Christopher Nguyen, David Chen

PMC · DOI: 10.1016/j.jocmr.2025.101968 · 2025-11-13

## TL;DR

This paper introduces a fine-tuned large language model for automatically extracting cardiovascular findings and attributes from cardiac MRI reports, improving clinical data processing.

## Contribution

A novel fine-tuned LLaMA model (CMR-LLaMA) that extracts 34 cardiovascular conditions and their attributes from CMR reports with high accuracy.

## Key findings

- The model achieved an average F1 score of 0.85 for identifying cardiovascular conditions in CMR reports.
- It demonstrated strong performance in extracting attributes like certainty and severity with average F1 scores of 0.97.
- The model showed moderate accuracy in external validation with an average F1 score of 0.78 for condition mentions.

## Abstract

Cardiac magnetic resonance imaging (CMR) studies contain a wealth of information on a patient’s cardiovascular status. The ability to extract this data from free-text reports could serve to automate clinical decision support tools and generate data for retrospective clinical knowledge discovery, and clinical operational purposes. Few studies have examined the automatic extraction of data from free-text CMR reports, and the existing studies that do have key limitations, including small sample size and disease-specific data extraction. Existing studies also fail to extract features associated with the cardiovascular conditions that reflect nuances in natural language, such as uncertainty, severity, subtype, and anatomical locations of the condition. The goal of this study was to build a broad named entity recognition model to automatically extract a broad variety of common CMR findings and their associated attributes from CMR reports.

We fine-tuned a Large Language Model Meta AI (LLaMA) model trained to identify 34 cardiovascular conditions and their associated attributes, including certainty, severity, location, and subtype of the condition. This model was trained on 1778 MRI reports and tested on 397 reports in an held-out test set and another 428 reports from another site in our hospital system with independent radiology practice and scanners.

Our model shows robust performance in predicting the mention of the 31 cardiovascular conditions (average F1 = 0.85). It also showed strong performance predicting attributes, including certainty (average F1 = 0.97) and severity (average F1 = 0.97). Model performance on the external validation set was generally slightly lower than the internal validation set, but performance was still strong (average F1 = 0.78 for mention, 0.97 for certainty, and 0.96 for severity).

CMR-LLaMA has strong performance identifying a variety of concept mentions and moderate accuracies in extracting a selection of other associated attributes. NLP models can be used to automate the extraction of data from CMR reports to potentially assist with clinical and research workflow.

ga1

## Full-text entities

- **Diseases:** cardiovascular conditions (MESH:D002318), CMR-LLaMA (MESH:C564543)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12766592/full.md

---
Source: https://tomesphere.com/paper/PMC12766592