# Personalized ICU mortality assessment by interpretable machine learning algorithms in patients with sepsis combined lung cancer: a population-based study and an external validation cohort

**Authors:** Hongjie Tang, Hairong Hao, Yue Han

PMC · DOI: 10.3389/fonc.2025.1661212 · 2025-10-01

## TL;DR

This study uses machine learning to predict ICU mortality in lung cancer patients with sepsis, offering a personalized and interpretable approach.

## Contribution

A novel interpretable machine learning model for ICU mortality prediction in sepsis patients with lung cancer is developed and validated.

## Key findings

- CatBoost model achieved the highest AUC of 0.931 in the training cohort.
- Oxford Acute Severity of Illness Score (OASIS) was the most influential factor in ICU mortality prediction.
- The model showed good performance in external validation with an AUC of 0.794.

## Abstract

Sepsis is a leading cause of mortality, especially among immunocompromised patients with lung cancer. We aimed to establish machine learning (ML) based model to accurately forecast ICU mortality in patients with sepsis combined lung cancer.

We incorporated patients with sepsis combined lung cancer from Medical Information Mart for Intensive Care IV (MIMIC IV) database. Univariate and multivariate logistic analysis were employed to select variables. Recursive Feature Elimination (RFE) method based on 6 ML algorithms was used for feature selection. We harnessed 13 ML algorithms to construct prediction model, which were assessed by area under the curve (AUC), accuracy, sensitivity, specificity, precision, cross-entropy and Brier scores. The best ML model was constructed to predict ICU mortality, and the predictive results were interpretated by SHapley Additive exPlanations (SHAP) framework.

A sum of 1096 lung cancer patients combined sepsis from MIMIC IV database and 251 patients from the external validation set were included. We utilized 13 clinical variables to establish prediction model for ICU mortality. CatBoost model was identified as the prime prediction model with the highest AUC in the training (0.931 [0.921, 0.945]), internal validation (0.698 [0.673, 0.724]) and external validation (0.794 [0.725, 0.879]) cohorts. Oxford Acute Severity of Illness Score (OASIS) had the greatest influence on ICU mortality according to SHAP interpretation.

Our ML models demonstrate excellent accuracy and reliability, facilitating more rigorous personalized prognostic forecast to lung cancer patients combined sepsis.

## Linked entities

- **Diseases:** lung cancer (MONDO:0005138)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Diseases:** Sepsis (MESH:D018805), lung cancer (MESH:D008175)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12522119/full.md

---
Source: https://tomesphere.com/paper/PMC12522119