A SHAP-based explainable multi-level stacking ensemble learning method for predicting the length of stay in acute stroke
Zhenran Xu

TL;DR
This study develops an explainable multi-level stacking ensemble model using SHAP for predicting prolonged length of stay in acute stroke patients, improving performance and interpretability over traditional models.
Contribution
It introduces a novel SHAP-based explainable ensemble approach tailored for stroke LOS prediction, incorporating system-level factors and feature selection.
Findings
Ensemble model outperformed logistic regression in ischemic stroke prediction.
SHAP analysis identified key predictors like rehabilitation and care involvement.
Model achieved high discrimination with AUC of 0.824 for ischemic stroke.
Abstract
Length of stay (LOS) prediction in acute stroke is critical for improving care planning. Existing machine learning models have shown suboptimal predictive performance, limited generalisability, and have overlooked system-level factors. We aimed to enhance model efficiency, performance, and interpretability by refining predictors and developing an interpretable multi-level stacking ensemble model. Data were accessed from the biennial Stroke Foundation Acute Audit (2015, 2017, 2019, 2021) in Australia. Models were developed for ischaemic and haemorrhagic stroke separately. The outcome was prolonged LOS (the LOS above the 75th percentile). Candidate predictors (ischaemic: n=89; haemorrhagic: n=83) were categorised into patient, clinical, and system domains. Feature selection with correlation-based approaches was used to refine key predictors. The evaluation of models included…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Acute Ischemic Stroke Management · Artificial Intelligence in Healthcare
MethodsShapley Additive Explanations · Logistic Regression · Feature Selection
