# Evaluating the impact of common clinical confounders on performance of deep-learning-based sepsis risk assessment

**Authors:** Shikha Chaganti, Vivek Singh, Alasdair Edward Gent, Rishikesan Kamaleswaran, Ali Kamen

PMC · DOI: 10.3389/frai.2025.1452471 · 2025-07-15

## TL;DR

This paper evaluates how clinical confounders affect a deep-learning model's ability to detect sepsis risk early in emergency departments.

## Contribution

The study introduces a deep-learning model for early sepsis detection and evaluates the impact of label definitions and comorbidities on model performance.

## Key findings

- The consensus-based model achieved 83.7% sensitivity and 80% specificity in identifying sepsis risk within 24 hours.
- Infection-confirmed subgroups showed high PPV (77%), but specificity dropped in cohorts with comorbidities.
- The study highlights limitations of retrospective sepsis definitions in automated detection systems.

## Abstract

Early identification of sepsis in the emergency department using machine learning remains a challenging problem, primarily due to the lack of a gold standard for sepsis diagnosis, the heterogeneity in clinical presentations, and the impact of confounding conditions.

In this work, we present a deep-learning-based predictive model designed to enable early detection of patients at risk of developing sepsis, using data from the first 24 h of admission. The model is based on routine blood test results commonly performed on patients, including CBC (Complete Blood Count), CMP (Comprehensive Metabolic Panel), lipid panels, vital signs, age, and sex. To address the challenge of label uncertainty as a part of the training process, we explore two different definitions, namely, Sepsis-3 and Adult Sepsis Event. We analyze the advantages and limitations of each in the context of patient clinical parameters and comorbidities. We specifically examine how the quality of the ground truth label influences the performance of the deep learning system and evaluate the effect of a consensus-based approach that incorporates both definitions. We also evaluated the model's performance across sub-cohorts, including patients with confounding comorbidities (such as chronic kidney, liver disease, and coagulation disorders) and those with infections confirmed by billing codes.

Our results show that the consensus-based model identifies at-risk patients in the first 24 h with 83.7% sensitivity, 80% specificity, 36% PPV, 97% NPV, and an AUC of 0.9. Our cohort-wise analysis revealed a high PPV (77%) in infection-confirmed subgroups and a drop in specificity across cohorts with confounding comorbidities (47-70%).

This work highlights the limitations of retrospective sepsis definitions and underscores the need for tailored approaches in automated sepsis detection, particularly when dealing with patients with confounding comorbidities.

## Linked entities

- **Diseases:** chronic kidney disease (MONDO:0005300), liver disease (MONDO:0005154)

## Full-text entities

- **Genes:** SEPTIN1 (septin 1) [NCBI Gene 1731] {aka DIFF6, LARP, PNUTL3, SEP1, SEPT1, Septin-1}, ITIH3 (inter-alpha-trypsin inhibitor heavy chain 3) [NCBI Gene 3699] {aka H3P, ITI-HC3, SHAP}
- **Diseases:** fever (MESH:D005334), SIRS (MESH:D018746), acute organ failures (MESH:D058186), hemorrhage (MESH:D006470), Infection (MESH:D007239), Organ Failure (MESH:D009102), tachypnea (MESH:D059246), trauma (MESH:D014947), inflammatory (MESH:D007249), coagulation (MESH:D001778), chronic kidney disease (MESH:D051436), died (MESH:D003643), ASE (MESH:D018805), hypocalcemia (MESH:D006996), liver cancer (MESH:D006528), platelet count (MESH:D011225), tachycardia (MESH:D013610), fatigue (MESH:D005221), mental disorders (MESH:D001523), hypothermia (MESH:D007035), liver disease (MESH:D008107)
- **Chemicals:** bicarbonate (MESH:D001639), Calcium (MESH:D002118), glucose (MESH:D005947), ASE (-), Creatinine (MESH:D003404), bilirubin (MESH:D001663), lipid (MESH:D008055)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12305701/full.md

---
Source: https://tomesphere.com/paper/PMC12305701