# Prognostic value of machine learning for brain computed tomography as a predictor of neurologic outcomes after cardiac arrest: a systematic review and meta-analysis

**Authors:** Kyung Hun Yoo, Juncheol Lee, Wonhee Kim, Bitnarae Kim, Elleah Jueun Chin, Jae-Guk Kim, Hyun-Young Choi, Jaehoon Oh

PMC · DOI: 10.1186/s13049-026-01565-w · Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine · 2026-01-30

## TL;DR

This study compares machine learning with manual and automatic methods for predicting neurological outcomes after cardiac arrest using brain CT scans.

## Contribution

The study provides the first meta-analysis comparing machine learning's diagnostic accuracy against manual and automatic GWR measurements in post-cardiac arrest patients.

## Key findings

- Machine learning showed higher diagnostic accuracy than manual GWR measurement.
- Machine learning was statistically equivalent to automatic GWR measurement.
- Machine learning may be a valuable tool for predicting poor neurological outcomes after cardiac arrest.

## Abstract

The gray-to-white matter ratio (GWR) on brain computed tomography (CT) is used to predict neurological outcomes after cardiac arrest. Even though automated methods, such as automatic GWR and machine learning, have been compared to manual GWR, the superiority remains unknown. Therefore, we conducted a systematic review and meta-analysis to compare the diagnostic accuracy of these three CT-based methods.

We systematically searched MEDLINE, EMBASE, Web of Science, Scopus, and IEEE Xplore and included studies evaluating neurological outcomes using the Cerebral Performance Category Scale. We performed a subgroup analysis to compare machine learning with manual or automatic GWR measurements. The Prediction model Risk of Bias ASsessment Tool was used to assess the risk of bias and applicability. Diagnostic accuracy for predicting poor neurological outcomes was evaluated using the pooled diagnostic odds ratio (DOR) and pooled area under the curve (AUC).

In total, 1594 patients from seven observational studies were included. Machine learning showed significantly higher diagnostic accuracy (pooled AUC, 0.813; pooled DOR, 14.02; 95% confidence interval [CI], 6.51–30.18; I2 = 63.1%) than manual GWR measurement (pooled AUC, 0.755; pooled DOR, 5.16; 95% CI, 3.75–7.08, I2 = 0%; p = 0.02). Machine learning showed statistically equivalent diagnostic accuracy, although it was numerically lower than automatic GWR measurement (pooled AUC, 0.832; pooled DOR, 11.92, 95% CI, 7.55–18.82; I2 = 24.3%; p = 0.72) for predicting poor neurological outcomes.

Machine learning in brain CT may have significant diagnostic value for predicting poor neurological outcomes in post-cardiac arrest patients. Machine learning may be comparable to automatic GWR measurement and outperform manual GWR measurement in terms of diagnostic accuracy.

The online version contains supplementary material available at 10.1186/s13049-026-01565-w.

## Linked entities

- **Diseases:** cardiac arrest (MONDO:0000745)

## Full-text entities

- **Diseases:** cardiac arrest (MESH:D006323)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12931003/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12931003/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC12931003/full.md

---
Source: https://tomesphere.com/paper/PMC12931003