# Multimodal deep learning for acute myocardial infarction detection from 12-lead electrocardiogram: a multi-centre study with cross-hospital validation

**Authors:** Vibha Gupta, Lukas Hilgendorf, Erik Andersson, Antros Louca, Arman Shahmari, Alfred Hjalmarsson, Rajkumar Saini, Carlo Pirazzi, Monér Alchay, Araz Rawshani

PMC · DOI: 10.1093/ehjdh/ztaf125 · 2025-10-27

## TL;DR

A deep learning model was developed to detect heart attacks from ECGs and showed strong performance across multiple hospitals.

## Contribution

A novel multimodal deep learning model (Conv-BiLSTM-Attn) for AMI detection with cross-hospital validation.

## Key findings

- The model achieved an AUROC of 0.848 and AUPRC of 0.456 under cross-hospital validation.
- Sensitivity and specificity ranged from 0.671–0.776 and 0.651–0.801 across hospitals.
- The model outperformed benchmark models and highlighted relevant ECG segments using Score-CAM.

## Abstract

Acute myocardial infarction (AMI) remains a leading global cause of mortality, where timely diagnosis is critical to enable early intervention. The 12-lead electrocardiogram (ECG) is a critical tool for AMI detection. While deep learning (DL) models show promise for automated ECG analysis, most prior studies rely on small, curated datasets with limited external validation, limiting their clinical applicability.

We developed a multimodal DL model (Conv-BiLSTM-Attn) integrating convolutional and recurrent neural networks with an attention mechanism. Using a large, multi-centre dataset of 145 656 ECGs from 96 813 patients across three Swedish hospitals. We trained the model to detect AMI using raw 12-lead ECG signals and demographic inputs (age, sex). Model performance was evaluated under two external validation protocols: generalization across hospitals (GAH) and leave-one-hospital-out (LOHO). The model achieved an area under the receiver operating characteristic (AUROC) of 0.848 (95% CI: 0.84–0.86) and an area under the precision-recall (AUPRC) of 0.456, reflecting class imbalance (∼6–10% AMI prevalence) under the GAH protocol. Subgroup AUROCs ranged from 0.79 to 0.92 across age and sex groups. At the Youden-optimized threshold (0.439), the model showed sensitivity of 0.736, specificity of 0.793, negative predictive value of 0.976, and weighted F1-score of 0.837. Under the LOHO protocol, AUROCs ranged from 0.801–0.849. At Youden thresholds (0.34–0.50), sensitivity ranged from 0.671 to 0.776 and specificity from 0.651 to 0.801, confirming generalizability across sites. Conv-BiLSTM-Attn outperformed benchmark models, with Score-CAM highlighting relevant ST–T segments.

This DL model can support accurate and generalizable AMI detection from routine ECGs, with the Conv-BiLSTM-Attn architecture outperforming current benchmark approaches.

Graphical Abstract

## Linked entities

- **Diseases:** acute myocardial infarction (MONDO:0004781)

## Full-text entities

- **Diseases:** AMI (MESH:D009203), ST-T (MESH:D001260)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12853118/full.md

---
Source: https://tomesphere.com/paper/PMC12853118