# Stage prediction of acute kidney injury in sepsis patients using explainable machine learning approaches

**Authors:** Zhen Quan, Zheng Han, Siyao Zeng, Lianghe Wen, Jingkai Wang, Yue Li, Hongliang Wang

PMC · DOI: 10.3389/fmed.2025.1667488 · Frontiers in Medicine · 2025-10-15

## TL;DR

This study uses machine learning to predict acute kidney injury stages in sepsis patients, aiming to improve early intervention and outcomes.

## Contribution

The novel contribution is developing and evaluating explainable machine learning models for AKI stage prediction in sepsis patients using SHAP for interpretability.

## Key findings

- The Random Forest model achieved an average AUC score of 0.89 for predicting AKI stages.
- SHAP analysis identified urine output, BMI, SOFA score, and blood urea nitrogen as key risk factors.

## Abstract

Acute kidney injury (AKI) is a prevalent and serious complication among sepsis patients, closely associated with high mortality rates and substantial disease burden. Early prediction of AKI is vital for prompt and effective intervention and improved prognosis. This research seeks to construct and assess forecasting frameworks that leverage advanced machine learning algorithms to anticipate AKI progression in high-risk sepsis patients.

This study utilized the MIMIC-IV database, a large, publicly available critical care dataset containing comprehensive, de-identified electronic health records of over 70,000 ICU admissions at Beth Israel Deaconess Medical Center, to extract sepsis patient data for model training and test. Following feature selection, various machine learning algorithms were employed, including Decision Tree (DT), Efficient Neural Network (ENet), k-Nearest Neighbor (KNN), Light Gradient Boosting Machine (LightGBM), Multi-Layer Perceptron (MLP), Multinomial Mixture Model (Multinom), Random Forest (RF), and eXtreme Gradient Boosting (XGBoost). A five-fold cross-test strategy was implemented to minimize bias and assess model performance. SHapley Additive exPlanations (SHAP) was used to interpret the results.

A total of 6,866 critically ill sepsis patients were analyzed, of whom 5,896 developed AKI during hospitalization The RF model demonstrated superior performance, attaining an average AUC score of 0.89 on the ROC curve. SHAP analysis provided detailed insights into feature importance, including urine output, BMI, SOFA score, and maximum blood urea nitrogen, enhancing the clinical applicability of the model.

The machine learning models developed in this study effectively predicted the stages of AKI in severely ill sepsis patients, with the Random Forest model demonstrating optimal performance. SHAP analysis offered crucial insights into the risk factors, facilitating timely and personalized interventions within a clinical setting. Additional multi-center research is essential to confirm the validity of these findings and to ultimately improve patient outcomes and quality of life.

## Linked entities

- **Diseases:** acute kidney injury (MONDO:0002492)

## Full-text entities

- **Diseases:** sepsis (MESH:D018805), critically ill sepsis (MESH:D016638), AKI (MESH:D058186)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12568512/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12568512/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC12568512/full.md

---
Source: https://tomesphere.com/paper/PMC12568512