# Diagnosis model for assessing chronic thromboembolic pulmonary hypertension in high-altitude pulmonary embolism patients: a machine learning approach

**Authors:** Entong Fan, Jiangping Ma, Yanjun Zhang, Boning Yang, Gulinigeer Zhakeer, Yini Huang, Qing Yu, Yanxi Zeng, Ma Mi

PMC · DOI: 10.3389/fmed.2025.1666574 · Frontiers in Medicine · 2025-10-07

## TL;DR

This study uses machine learning to develop a model for early detection of chronic thromboembolic pulmonary hypertension in patients with pulmonary embolism at high altitude.

## Contribution

A novel machine learning model using clinical data to predict CTEPH in high-altitude PE patients is proposed.

## Key findings

- The Random Forest model achieved an AUC of 0.842 in predicting CTEPH.
- Enlarged right heart structures and superior pulmonary embolism increased CTEPH risk.
- Normal right heart and isolated inferior embolism reduced CTEPH risk.

## Abstract

Patients with pulmonary embolism (PE) at high altitude face an increased risk of developing chronic thromboembolic pulmonary hypertension (CTEPH). This study aims to establish a diagnosis model of CTEPH patients at high altitude to optimize early screening.

A retrospective cohort of CTEPH and PE patients was rigorously selected through inclusion/exclusion criteria. Clinical data encompassing biochemical profiles, echocardiography, and CT angiography (CTA) were collected, yielding 103 candidate variables. Feature parameters were screened using the Boruta algorithm, followed by predictive model development with seven machine learning architectures. The optimal model was identified based on area under the curve (AUC). The optimal Random Forest model was subsequently interpreted through Shapley Additive Explanations (SHAP) to quantify feature contributions.

Among 57 PE patients, 44% met echocardiographic criteria for pulmonary hypertension following PE. Diameter of right atrium, diameter of right ventricle, Vessel-Grade (of embolization) and Sup-inferior (superior or inferior of embolization) were key identified predictors. Random Forests model had the highest AUC of 0.842. Enlarged right heart, embolization of small vessels and superior pulmonary artery embolism increased the risk of CTEPH, while normal right heart structure and isolated inferior pulmonary embolism reduced it.

The Random Forests model demonstrated potential for detecting CTEPH in PE patients, enabling early and rapid pulmonary hypertension assessment.

## Linked entities

- **Diseases:** pulmonary embolism (MONDO:0005279), chronic thromboembolic pulmonary hypertension (MONDO:0013024), pulmonary hypertension (MONDO:0005149)

## Full-text entities

- **Diseases:** CTEPH (MESH:D011655), pulmonary hypertension (MESH:D006976)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12537356/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12537356/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/PMC12537356/full.md

---
Source: https://tomesphere.com/paper/PMC12537356