# An interpretable machine-learning model for predicting in-hospital mortality in patients with sepsis-associated acute kidney injury

**Authors:** Jian-Zhong Wang, Nan Zhang, Rong-Rong Ma, Mei Yang, Yao-Geng Chen, Wen-Jie Zhou

PMC · DOI: 10.3389/fmed.2026.1756831 · Frontiers in Medicine · 2026-02-13

## TL;DR

This study develops and validates an interpretable machine learning model to predict in-hospital mortality in patients with sepsis-associated acute kidney injury.

## Contribution

The study introduces an interpretable ML model for predicting mortality in sepsis-associated acute kidney injury patients using clinical data and SHAP for interpretability.

## Key findings

- XGBoost showed superior performance with an AUC of 0.8799 in predicting in-hospital mortality.
- SHAP analysis identified SAPS II score, AKI stage, and biochemical markers as key predictors of mortality.
- The model was externally validated, showing robustness and potential for clinical use.

## Abstract

Sepsis-associated acute kidney injury (S-AKI) is a severe complication in critically ill patients, linked to increased short-term mortality and chronic kidney disease. Existing prognostic tools like SOFA and SAPS II lack full representation of intricate clinical variable interactions. Machine learning (ML) models have potential in intensive care, but few validated and interpretable models focus on the in-hospital mortality rate of patients with S-AKI. This study aims to create and validate ML models for forecasting in-hospital mortality in S-AKI patients, identifying the most effective predictive model.

We conducted a retrospective analysis of data from the MIMIC-IV 3.0 database to identify adult ICU patients who met theSepsis-3.0(Sepsis-3 was defined as suspected infection with an acute increase in SOFA score ≥2) and KDIGO criteria for S-AKI. Additionally, a prospective cohort study from the General Hospital of Ningxia Medical University spanning 2023 to 2025 was included. Predictors recorded within 24 h of ICU admission included demographic information, comorbidities, vital signs, laboratory results, treatments, and severity scores. Variables with more than 20% missing data were excluded, and the remaining data were processed using interpolation. Feature selection was performed using the Boruta algorithm, and five machine learning models were trained (XGBoost, Random Forest, LightGBM, Decision tree, logistic regression). Model performance evaluation was based on metrics such as AUC, accuracy, sensitivity, specificity, F1 score, and clinical efficacy assessed through decision curve analysis. To enhance model interpretability, the SHapley Additive exPlanations (SHAP) method was employed.

Among 16,800 patients with S-AKI, non-survivors(in-hospital mortality) exhibited older age, higher disease severity scores, more pronounced fluid overload, poorer renal function, metabolic acidosis, coagulation disorders, and heightened inflammatory responses. The XGBoost model demonstrated superior discriminative power (AUC 0.8799 置信区间) in internal validation, surpassing other ML models, with exceptional sensitivity, accuracy, and F1 score. Decision curve analysis revealed that LightGBM offered the most significant net clinical benefit across various threshold probabilities. SHAP analysis consistently identified SAPS II score, AKI stage, oxygenation index, and key biochemical markers (e.g., serum sodium and blood urea nitrogen) as primary contributors to mortality risk, while the added value of basic demographic variables was limited. External validation confirmed that the XGBoost model has potential discrimination and robustness, highlighting the robustness and wide applicability of the machine learning-based prognostic framework.

This study established an externally validated and interpretable ML model for riskstratification in S-AKI, enabling early identification of high-risk patients, personalized management strategies, and enhanced clinical outcomes in sepsis care.

## Linked entities

- **Diseases:** chronic kidney disease (MONDO:0005300)

## Full-text entities

- **Genes:** SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, SKAP2 (src kinase associated phosphoprotein 2) [NCBI Gene 8935] {aka PRAP, RA70, SAPS, SCAP2, SKAP-HOM, SKAP55R}, ALB (albumin) [NCBI Gene 213] {aka FDAHT, HSA, PRO0883, PRO0903, PRO1341}
- **Diseases:** chronic lung disease (MESH:D029424), AKI (MESH:D058186), Organ Failure (MESH:D009102), hypoxemia (MESH:D000860), decrease in systolic blood pressure (MESH:D007022), Acute (MESH:D000208), inflammation (MESH:D007249), liver disease (MESH:D008107), critical illness (MESH:D016638), fluid overload (MESH:D019190), CKD (MESH:D051436), decline of renal function (MESH:D060825), Diabetes (MESH:D003920), S (MESH:D018455), Kidney disease (MESH:D007674), congestive heart failure (MESH:D006333), Sepsis (MESH:D018805), immune dysregulation (OMIM:614878), renal cell injury (MESH:D002280), death (MESH:D003643), hypertension (MESH:D006973), II (MESH:C537730), hypothermia (MESH:D007035), microvascular dysfunction (MESH:D017566), metabolic acidosis (MESH:D000138), coagulation disorder (MESH:D001778), infection (MESH:D007239), myocardial infarction (MESH:D009203), chronic renal failure (MESH:D007676)
- **Chemicals:** chloride (MESH:D002712), norepinephrine (MESH:D009638), oxygen (MESH:D010100), bilirubin (MESH:D001663), lactate (MESH:D019344), nitrogen (MESH:D009584), urea nitrogen (MESH:C530477), carbon dioxide (MESH:D002245), creatinine (MESH:D003404), glucose (MESH:D005947), calcium (MESH:D002118), Potassium (MESH:D011188), SA (MESH:D000077145), sodium (MESH:D012964), LightGBM (-), S (MESH:D013455), urea (MESH:D014508)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12946044/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12946044/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12946044/full.md

---
Source: https://tomesphere.com/paper/PMC12946044