# Predicting Mortality in Pulmonary Embolism: A Machine Learning Approach with External Validation in COVID-19 Patients

**Authors:** Diana Alexandra Mîțu, Alexandru Cristian Cindrea, Alexandra Maria Borita, Adina Maria Marza, Corneluța Fira-Mladinescu, Madalin-Marius Margan, Alexandra Herlo, Alina Petrica, Gabriel-Aurel Rus, Daniel-Florin Lighezan, Flavia Zara, Ovidiu Alexandru Mederle

PMC · DOI: 10.3390/medicina62020421 · 2026-02-23

## TL;DR

This study uses machine learning to predict mortality in patients with pulmonary embolism, finding that models work better in non-COVID patients and worse in those with COVID-19.

## Contribution

The study introduces machine learning models for predicting mortality in pulmonary embolism and validates their performance in both non-COVID and COVID-19 patients.

## Key findings

- Machine learning models outperformed PESI in predicting mortality for non-COVID pulmonary embolism patients.
- Model performance significantly declined when applied to patients with concomitant COVID-19.
- Sepsis, PESI class V, and biomarkers like NT-proBNP were strongly associated with mortality.

## Abstract

Background and Objectives: Pulmonary embolism (PE) is a frequent thrombotic complication associated with SARS-CoV-2 infection and is linked to significant early mortality. Accurate early risk stratification in the emergency department (ED) remains challenging, and it is unclear how well commonly used PE prognostic tools perform in patients with concomitant COVID-19. Materials and Methods: We conducted a retrospective, single-centre study including 538 consecutive patients with acute PE and with or without confirmed SARS-CoV-2 infection admitted through the ED. Univariate analysis and machine learning models were employed to assess mortality risk. Results: In univariate analysis, mortality was strongly associated with sepsis (OR 11.68) and PESI class V (OR 5.56) and was also linked to higher neutrophil count (OR 1.19), platelet count (OR 1.12), and NT-proBNP (OR 1.20). In the non-COVID cohort, XGBoost and RF showed better discrimination than PESI class (AUC 0.864 and 0.834 vs. 0.725), while Support Vector Machines (SVM) was lower (AUC 0.740). On COVID-19 external validation, discrimination decreased: XGBoost AUC was 0.635, RF 0.614, PESI 0.584, and SVM showed no discrimination. Conclusions: ML models using routinely available ED variables improved in-hospital mortality prediction compared with PESI in non-COVID PE, but performance declined in COVID-19 patients, suggesting limited generalizability and the need for COVID-specific refinement and prospective multicenter validation.

## Linked entities

- **Diseases:** pulmonary embolism (MONDO:0005279), COVID-19 (MONDO:0100096)

## Full-text entities

- **Genes:** GGTLC5P (gamma-glutamyltransferase light chain 5 pseudogene) [NCBI Gene 653590] {aka GGT}, GPT (glutamic--pyruvic transaminase) [NCBI Gene 2875] {aka AAT1, ALT, ALT1, GPT1, SGPT}, MPO (myeloperoxidase) [NCBI Gene 4353], ELANE (elastase, neutrophil expressed) [NCBI Gene 1991] {aka ELA2, GE, HLE, HNE, NE, PMN-E}, GGT1 (gamma-glutamyltransferase 1) [NCBI Gene 2678] {aka CD224, D22S672, D22S732, GGT, GGT 1, GGTD}
- **Diseases:** injury to (MESH:D014947), inflammation (MESH:D007249), cardiopulmonary disease (MESH:D006323), DM (MESH:D009223), alveolar damage (MESH:D055370), dyspnea (MESH:D004417), Diabetes Mellitus (MESH:D003920), cardiac strain (MESH:D013180), obesity (MESH:D009765), inflammatory dysregulation (MESH:D021081), fever (MESH:D005334), emboli (MESH:D020766), PE (MESH:D011655), hypertension (MESH:D006973), Venous Thromboembolism (MESH:D054556), Mortality (MESH:D003643), microvascular injury (MESH:D017566), pulmonary thrombosis (MESH:D013927), DVT (MESH:D020246), endothelial injury (MESH:D057772), cough (MESH:D003371), right ventricular dysfunction (MESH:D018497), coagulation (MESH:D001778), infection (MESH:D007239), COVID (MESH:D000086382), thrombotic microangiopathy (MESH:D057049), thromboembolic (MESH:D013923), embolism (MESH:D004617), heart failure (MESH:D006333), Sepsis (MESH:D018805), pulmonary hypertension (MESH:D006976), Infectious Disease (MESH:D003141), septic (MESH:D001170)
- **Chemicals:** cholesterol (MESH:D002784), D (MESH:D003903)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12943754/full.md

---
Source: https://tomesphere.com/paper/PMC12943754