Explainable Machine Learning for Sepsis Outcome Prediction Using a Novel Romanian Electronic Health Record Dataset

Andrei-Alexandru Bunea; Ovidiu Ghibea; Dan-Matei Popovici; Ion Daniel; Octavian Andronic

arXiv:2604.04698·cs.LG·April 7, 2026

Explainable Machine Learning for Sepsis Outcome Prediction Using a Novel Romanian Electronic Health Record Dataset

Andrei-Alexandru Bunea, Ovidiu Ghibea, Dan-Matei Popovici, Ion Daniel, Octavian Andronic

PDF

TL;DR

This study develops explainable machine learning models using a novel Romanian EHR dataset to predict sepsis outcomes, identifying key clinical predictors with high accuracy and interpretability.

Contribution

It introduces a new dataset and demonstrates state-of-the-art predictive models with explainability for sepsis outcomes in Romanian hospitals.

Findings

01

Highest AUC of 0.983 for deceased vs. recovered prediction

02

SHAP analysis identified cardiovascular comorbidities and eosinophil levels as key predictors

03

Eosinopenia is a valuable, underutilized marker in sepsis assessment

Abstract

We develop and analyze explainable machine learning (ML) models for sepsis outcome prediction using a novel Electronic Health Record (EHR) dataset from 12,286 hospitalizations at a large emergency hospital in Romania. The dataset includes demographics, International Classification of Diseases (ICD-10) diagnostics, and 600 types of laboratory tests. This study aims to identify clinically strong predictors while achieving state-of-the-art results across three classification tasks: (1)deceased vs. discharged, (2)deceased vs. recovered, and (3)recovered vs. ameliorated. We trained five ML models to capture complex distributions while preserving clinical interpretability. Experiments explored the trade-off between feature richness and patient coverage, using subsets of the 10--50 most frequent laboratory tests. Model performance was evaluated using accuracy and area under the curve (AUC),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.