# Risk Stratification for In-Hospital Mortality in Alzheimer’s Disease Using Interpretable Regression and Explainable AI

**Authors:** Tursun Alkam, Ebrahim Tarshizi, Andrew H. Van Benschoten

PMC · DOI: 10.3390/geriatrics11020023 · 2026-02-24

## TL;DR

This study combines regression and machine learning to predict in-hospital mortality for Alzheimer’s patients and identifies key risk factors.

## Contribution

The study introduces a hybrid approach using interpretable regression and explainable AI to improve mortality risk stratification in Alzheimer’s patients.

## Key findings

- Logistic regression and XGBoost models achieved similar performance in predicting mortality, with XGBoost slightly outperforming.
- Palliative care, acute respiratory failure, DNR status, and sepsis were the strongest predictors of in-hospital mortality.
- SHAP analysis revealed additional actionable risk factors like dysphagia and malnutrition when end-of-life indicators were excluded.

## Abstract

Background: Older adults with Alzheimer’s disease (AD) face a heightened risk of adverse hospital outcomes, including mortality. However, early identification of high-risk patients remains a challenge. While regression models provide interpretable associations, they may miss non-linear interactions that machine learning can uncover. Objective: To identify key predictors of in-hospital mortality among AD patients using both survey-weighted logistic regression and explainable machine learning. Methods: We analyzed hospitalizations among AD patients aged ≥60 in the 2017 Nationwide Inpatient Sample (NIS). The outcome was in-hospital death. Predictors included demographics, hospital variables, and 15 comorbidities. Logistic regression used survey weighting to generate nationally representative inference; XGBoost incorporated NIS discharge weights as sample weights during 5-fold hospital-grouped cross-validation and used the same weights in performance evaluation. Missing-value imputation and feature scaling were performed within the cross-validation pipelines to prevent data leakage. Model performance was assessed using AUROC, AUPRC, Brier score, and log loss. Feature importance was assessed using adjusted odds ratios and SHapley Additive exPlanations (SHAP). A sensitivity analysis excluded palliative care and DNR status and was re-evaluated under the same grouped cross-validation. Results: In the full model, logistic regression achieved AUROC 0.879 and AUPRC 0.310, while XGBoost achieved AUROC 0.887 and AUPRC 0.324. Palliative care (aOR 6.19), acute respiratory failure (aOR 5.15), DNR status (aOR 2.20), and sepsis (aOR 2.26) were the strongest logistic predictors. SHAP analysis corroborated these findings and additionally emphasized dysphagia, malnutrition, and pressure ulcers. In sensitivity analysis excluding palliative care and DNR status, logistic regression performance declined (AUROC 0.806; AUPRC 0.206), while XGBoost performed similarly (AUROC 0.811; AUPRC 0.206). SHAP corroborated the dominant signals from end-of-life documentation and acute organ failure in the full model; in the restricted model (excluding DNR and palliative care), SHAP highlighted physiologic and frailty-related features (e.g., dysphagia, malnutrition, aspiration risk) that may be more actionable when end-of-life documentation is absent. Conclusions: Combining regression with explainable machine learning enables robust mortality risk stratification in hospitalized AD patients. Restricted models excluding end-of-life indicators provide actionable risk signals when such documentation is absent, while the full model may better support resource allocation and goals-of-care workflows.

## Linked entities

- **Diseases:** Alzheimer’s disease (MONDO:0004975), acute respiratory failure (MONDO:0001208)

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}
- **Diseases:** death (MESH:D003643), malnutrition (MESH:D044342), Hypothyroidism (MESH:D007037), aspiration (MESH:D011015), infection (MESH:D007239), injury to (MESH:D014947), AI (MESH:C538142), neurodegenerative disorder (MESH:D019636), coronary artery disease (MESH:D003324), frailty (MESH:D000073496), cognitive decline (MESH:D003072), anemia (MESH:D000740), acute (MESH:D000208), pathology (MESH:D005598), sepsis (MESH:D018805), pressure ulcers (MESH:D003668), dementia (MESH:D003704), dehydration (MESH:D003681), cerebrovascular disease (MESH:D002561), atrial fibrillation (MESH:D001281), urinary tract infection (MESH:D014552), acute kidney injury (MESH:D058186), organ dysfunction (MESH:D009102), Dysphagia (MESH:D003680), COVID (MESH:D000086382), Acute respiratory failure (MESH:D012131), cardiometabolic multimorbidity (MESH:D024821), AD (MESH:D000544), congestive heart failure (MESH:D006333)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13010599/full.md

---
Source: https://tomesphere.com/paper/PMC13010599