# Identifying novel risk factors for aneurysmal subarachnoid haemorrhage using machine learning

**Authors:** Jos P. Kanning, Junfeng Wang, Shahab Abtahi, Mirjam I. Geerlings, Ynte M. Ruigrok

PMC · DOI: 10.1038/s41598-025-88826-3 · Scientific Reports · 2025-03-18

## TL;DR

This study uses machine learning to find new risk factors for a type of stroke called aneurysmal subarachnoid haemorrhage.

## Contribution

The study introduces novel risk factors for aSAH identified through machine learning and SHAP analysis.

## Key findings

- Mean sphered cell volume and tea intake were associated with increased aSAH risk.
- Peak expiratory flow and haematocrit percentage were linked to decreased aSAH risk.
- Machine learning identified 214 variables with non-zero SHAP values for aSAH prediction.

## Abstract

Aneurysmal subarachnoid haemorrhage (aSAH) is a type of stroke with high mortality and morbidity. This study aimed to identify novel aSAH risk factors by combining machine learning (ML) and traditional statistical methods. Using the UK Biobank, we identified aSAH cases via hospital-based ICD codes and analysed 618 baseline variables covering demographics, lifestyle, medical history, and physical measurements. The CatBoost ML algorithm and Shapley Additive Explanations (SHAP) identified the top 25 variables most influential in predicting aSAH. Logistic regression further described these variables while adjusting for established aSAH risk factors. Among 501,847 participants, 893 aSAH cases were identified. ML identified 214 variables with non-zero SHAP values. Logistic regression of the top 25 variables revealed four potential novel aSAH risk factors. Increased aSAH risk was associated with mean sphered cell volume (OR 1.02, 95% CI 1.00-1.03) and tea intake (OR 1.03, 95% CI 1.01–1.05). Decreased aSAH risk was associated with peak expiratory flow (OR 0.80, 95% CI 0.66–0.96), and haematocrit percentage (OR 0.97, 95% CI 0.95-1.00). Future research should validate these findings and explore the potential non-linear relationships and interactions indicated by the ML models.

The online version contains supplementary material available at 10.1038/s41598-025-88826-3.

## Full-text entities

- **Diseases:** stroke (MESH:D020521), Aneurysmal subarachnoid haemorrhage (MESH:D013345)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11920089/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11920089/full.md

## References

7 references — full list in the complete paper: https://tomesphere.com/paper/PMC11920089/full.md

---
Source: https://tomesphere.com/paper/PMC11920089