# Assessing performance, calibration, and explainability of machine learning versus traditional models for early outcome prediction after spontaneous intracerebral hemorrhage: a systematic review and meta-analysis protocol

**Authors:** Fan Bu, Rongzhen Xu, Xinyan Zhao, Qiaoxia He, Yandi Wen, Lile Xiong, Lan Qin, Hua Guan

PMC · DOI: 10.1186/s13643-025-03059-9 · Systematic Reviews · 2026-01-10

## TL;DR

This study will compare machine learning and traditional models for predicting outcomes after brain hemorrhage, focusing on accuracy, reliability, and explainability.

## Contribution

A systematic review and meta-analysis protocol to evaluate machine learning versus traditional models for outcome prediction after ICH.

## Key findings

- The study will assess discrimination, calibration, and explainability of ML models compared to traditional models.
- It will use PRISMA-P guidelines and the PROBAST+AI tool for bias assessment.
- Findings will guide future model design and clinical application for ICH prognosis.

## Abstract

Early outcome prediction after spontaneous intracerebral hemorrhage (ICH) is critical for patient management and counseling. Although machine learning (ML) models are increasingly applied, their comparative performance and explainability relative to traditional statistical models remain unclear.

To systematically compare the predictive performance, calibration, and explainability of ML versus traditional models for early outcomes after ICH.

Following PRISMA-P guidelines and registered in PROSPERO (CRD420251166996), this systematic review and meta-analysis will include studies developing, validating, or comparing ML and traditional models for predicting early mortality or poor functional outcome (mRS ≥ 3 or GOS ≤ 3) after ICH. Data sources will include PubMed, Embase, Scopus, Web of Science, Cochrane CENTRAL, IEEE Xplore, and major Chinese databases (CNKI, Wanfang, VIP, CBM). Two reviewers will independently screen studies, extract data, and assess risk of bias using the PROBAST + AI tool, which extends and replaces the original PROBAST framework for prediction models incorporating machine learning. Pooled analyses will employ random-effects models; confidence in the body of evidence will be summarized using an adapted approach informed by GRADE principles for prognosis evidence.

This review will explore whether ML-based models demonstrate differences in discrimination, calibration, and explainability compared with traditional models.

This review will provide a comprehensive, evidence-based assessment of prognostic modeling for ICH, guiding future model design, validation, and clinical application.

PROSPERO CRD420251166996

The online version contains supplementary material available at 10.1186/s13643-025-03059-9.

## Linked entities

- **Diseases:** intracerebral hemorrhage (MONDO:0013792), ICH (MONDO:0100533)

## Full-text entities

- **Diseases:** ICH (MESH:D002543)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12882189/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12882189/full.md

## References

8 references — full list in the complete paper: https://tomesphere.com/paper/PMC12882189/full.md

---
Source: https://tomesphere.com/paper/PMC12882189