# To develop a machine learning-based model for predicting the risk of gastrointestinal bleeding in patients with spontaneous intracerebral hemorrhage

**Authors:** Chenzhu Cai, Jiayin Wang, Mingfa Cai, Zhen Qi, Xieli Guo

PMC · DOI: 10.3389/fneur.2025.1690638 · Frontiers in Neurology · 2026-01-02

## TL;DR

This study creates a machine learning model to predict gastrointestinal bleeding risk in patients with brain hemorrhage, helping doctors identify high-risk patients early.

## Contribution

A novel machine learning model for predicting gastrointestinal bleeding in spontaneous intracerebral hemorrhage patients is developed and validated.

## Key findings

- The model achieved an AUC of 0.803 in internal validation and 0.757 in external validation.
- Key predictors include GCS score, intraventricular hemorrhage, surgeries, albumin, and distance to the midline.
- Calibration curves and decision curve analysis confirmed the model's reliability and clinical utility.

## Abstract

Spontaneous intracerebral hemorrhage (sICH) is a critical illness with a poor clinical prognosis, and gastrointestinal bleeding (GIB) is a severe complication that can significantly worsen the patient’s adverse outcomes. However, research on the risk factors for GIB in sICH patients is currently limited. Therefore, this study aims to construct and validate a predictive model for GIB risk in sICH patients using machine learning methods, providing decision support for the early identification of high-risk patients in clinical settings.

The present study retrospectively analysed the clinical data of 738 patients with sICH from two centres. In the feature selection process, the Boruta algorithm was initially employed for preliminary screening, and subsequently, the Information-Gain method was utilised to identify significant predictors. Following this, Spearman correlation analysis was implemented to eliminate collinearity between variables. During the model construction stage, the machine learning algorithm was optimized based on the internal test set, and the model performance was finally verified by the internal test set and the external validation set. In order to enhance the interpretability of the model, the SHapley Additive exPlanations (SHAP) method was used to visualize the prediction results.

The Glasgow Coma Scale (GCS) score, intraventricular extension of hemorrhage (ICH with IVH), surgeries, albumin, and distance to the midline were identified as significant predictors of GIB in patients with sICH. The patients were randomly divided into training and validation cohorts in an 8:2 ratio for model development and validation. An Extra Trees Classifier algorithm was used to construct the predictive model. Internal validation showed that the area under the receiver operating characteristic (ROC) curve (AUC) was 0.803 (95% CI: 0.659–0.947), while the AUC for external validation data was 0.757 (95% CI: 0.675–0.839). The calibration curves for both internal and external validation were close to the ideal diagonal line, and decision curve analysis (DCA) demonstrated that the model provided a substantial net benefit.

Our prediction model for GIB in sICH patients has reliable predictive power and provides a reliable tool for clinicians to identify early the high-risk group for GIB in sICH patients.

## Full-text entities

- **Genes:** ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}
- **Diseases:** GIB (MESH:D006471), ICH (MESH:D002543), hemorrhage (MESH:D006470), Coma (MESH:D003128)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12807890/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12807890/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12807890/full.md

---
Source: https://tomesphere.com/paper/PMC12807890