# Daily Automated Prediction of Delirium Risk in Hospitalized Patients: Model Development and Validation

**Authors:** Kendrick Matthew Shaw, Yu-Ping Shao, Manohar Ghanta, Valdery Moura Junior, Eyal Y Kimchi, Timothy T Houle, Oluwaseun Akeju, Michael Brandon Westover

PMC · DOI: 10.2196/60442 · JMIR Medical Informatics · 2025-04-18

## TL;DR

This paper presents a machine learning model that predicts delirium risk in hospitalized patients using electronic health records, aiming to improve early detection and intervention.

## Contribution

The novel contribution is the development and validation of an automated delirium risk prediction model using electronic medical record data.

## Key findings

- The boosted tree model achieved the highest predictive accuracy with an AUROC of 0.92.
- The model's performance dropped when predicting risk in patients without prior delirium.
- Electronic health data like lab values and medications were effective predictors of delirium risk.

## Abstract

Delirium is common in hospitalized patients and is correlated with increased morbidity and mortality. Despite this, delirium is underdiagnosed, and many institutions do not have sufficient resources to consistently apply effective screening and prevention.

This study aims to develop a machine learning algorithm to identify patients at the highest risk of delirium in the hospital each day in an automated fashion based on data available in the electronic medical record, reducing the barrier to large-scale delirium screening.

We developed and compared multiple machine learning models on a retrospective dataset of all hospitalized adult patients with recorded Confusion Assessment Method (CAM) screens at a major academic medical center from April 2, 2016, to January 16, 2019, comprising 23,006 patients. The patient’s age, gender, and all available laboratory values, vital signs, prior CAM screens, and medication administrations were used as potential predictors. Four machine learning approaches were investigated: logistic regression with L1-regularization, multilayer perceptrons, random forests, and boosted trees. Model development used 80% of the patients; the remaining 20% was reserved for testing the final models. Laboratory values, vital signs, medications, gender, and age were used to predict a positive CAM screen in the next 24 hours.

The boosted tree model achieved the greatest predictive power, with an area under the receiver operator characteristic curve (AUROC) of 0.92 (95% CI 0.913-9.22), followed by the random forest (AUROC 0.91, 95% CI 0.909-0.918), multilayer perceptron (AUROC 0.86, 95% CI 0.850-0.861), and logistic regression (AUROC 0.85, 95% CI 0.841-0.852). These AUROCs decreased to 0.78-0.82 and 0.74-0.80 when limited to patients who currently do not or never have had delirium, respectively.

A boosted tree machine learning model was able to identify hospitalized patients at elevated risk for delirium in the next 24 hours. This may allow for automated delirium risk screening and more precise targeting of proven and investigational interventions to prevent delirium.

## Linked entities

- **Diseases:** delirium (MONDO:0045057)

## Full-text entities

- **Diseases:** Confusion (MESH:D003221), Delirium (MESH:D003693)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12048784/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12048784/full.md

## References

37 references — full list in the complete paper: https://tomesphere.com/paper/PMC12048784/full.md

---
Source: https://tomesphere.com/paper/PMC12048784