# Clustering of disease trajectories with explainable machine learning: A case study on postoperative delirium phenotypes

**Authors:** Xiaochen Zheng, Ahmed Allam, Manuel Schürch, Xingyu Chen, Maria Angeliki Komninou, Reto Schüpbach, Jan Bartussek, Michael Krauthammer

PMC · DOI: 10.1371/journal.pdig.0001267 · PLOS Digital Health · 2026-03-23

## TL;DR

This paper introduces a new method to identify subtypes of postoperative delirium using machine learning and explainable AI, enabling more personalized treatment strategies.

## Contribution

A novel two-step approach combining risk prediction and SHAP-based clustering to uncover hidden phenotypes in complex diseases like postoperative delirium.

## Key findings

- Clustering patients based on SHAP feature scores successfully recovers true underlying phenotypes in synthetic data.
- Real-world data analysis reveals distinct subgroups of postoperative delirium patients with differing clinical profiles.
- The method outperforms traditional clustering in raw feature space for identifying meaningful disease subtypes.

## Abstract

The identification of phenotypes within complex diseases is a fundamental component of personalized medicine, which aims to adapt healthcare to individual patient characteristics. Postoperative delirium (POD) is a complex neuropsychiatric condition with significant heterogeneity in its clinical manifestations and underlying pathophysiology. We hypothesize that POD comprises several distinct phenotypes, which cannot be directly observed in clinical practice. Identifying these phenotypes could enhance our understanding of POD pathogenesis and facilitate the development of targeted prevention and treatment strategies. In this paper, we propose an approach that combines supervised machine learning for personalized POD risk prediction with unsupervised clustering technique to uncover potential POD phenotypes. We first demonstrate our approach using synthetic data, where we simulate patient cohorts with predefined phenotypes based on distinct sets of informative features. We aim to mimic any clinical disease with our synthetic data generation method. By training a predictive model and computing SHapley Additive exPlanations (SHAP), we show that clustering patients in the SHAP feature scoring space successfully recovers the true underlying phenotypes, outperforming clustering in the raw feature space. We then present a case study using real-world data from a cohort of elderly POD patients. We train machine learning models on heterogeneous electronic health record data covering the preoperative, intraoperative and postoperative stages to predict personalized POD risk. Subsequent clustering of patients based on their SHAP feature scores reveals distinct subgroups with differing clinical characteristics and risk profiles, potentially representing POD phenotypes. These results showcase the utility of our approach in uncovering clinically relevant subtypes of complex disorders like POD, paving the way for more precise and personalized treatment strategies.

Our research addresses how we can identify different subtypes within complex medical conditions using advanced data analysis techniques. While traditional medical approaches often treat conditions like postoperative delirium as uniform entities, we demonstrate that they actually comprise distinct subtypes with unique underlying characteristics. We’ve developed a novel two-step approach that first predicts a patient’s risk using machine learning algorithms, then identifies the specific factors driving that risk for each individual. By grouping patients based on these personalized risk factors rather than their raw medical data, we can uncover meaningful subtypes that weren’t previously apparent. Testing this approach with both synthetic and real-world medical data proved remarkably effective. The method successfully identified distinct patient subgroups with different clinical profiles, potentially representing different forms of postoperative delirium with unique underlying mechanisms. These findings have important implications for patient care. Understanding these subtypes could help clinicians develop more targeted prevention strategies and treatments tailored to a patient’s specific condition variant rather than using a one-size-fits-all approach. This work represents a significant step toward more personalized medicine that recognizes the inherent diversity within complex medical conditions, ultimately improving outcomes through more precise interventions.

## Full-text entities

- **Genes:** ITIH2 (inter-alpha-trypsin inhibitor heavy chain 2) [NCBI Gene 3698] {aka H2P, ITI-HC2, SHAP}
- **Diseases:** neuropsychiatric postoperative complication (MESH:D011183), confusion (MESH:D003221), Delirium (MESH:D003693), dementia (MESH:D003704), autoimmune diseases (MESH:D001327), attention deficits (MESH:D001289), hematological malignancy (MESH:D019337), cardiovascular and respiratory dysfunction (MESH:D018376), COVID-19 (MESH:D000086382), neuropsychiatric and neurological disorders (MESH:D009422), cognitive decline (MESH:D003072), sleep disturbances (MESH:D012893), neuroinflammatory (MESH:D000090862), ICU (MESH:C000657744), neuropsychiatric condition (MESH:D001523), immune system dysfunction (MESH:D007154), Sjogren's syndrome (MESH:D012859), critical illness (MESH:D016638), POD (MESH:D000071257)
- **Chemicals:** dexmedetomidine (MESH:D020927), glucose (MESH:D005947)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13008057/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13008057/full.md

## References

59 references — full list in the complete paper: https://tomesphere.com/paper/PMC13008057/full.md

---
Source: https://tomesphere.com/paper/PMC13008057