# Administrative Data Is Insufficient to Identify Near-Future Critical Illness: A Population-Based Retrospective Cohort Study

**Authors:** Allan Garland, Ruth Ann Marrie, Hannah Wunsch, Marina Yogendran, Daniel Chateau

PMC · DOI: 10.3389/fepid.2022.944216 · Frontiers in Epidemiology · 2022-07-25

## TL;DR

This study found that administrative data is not sufficient to accurately predict near-future critical illness in adults.

## Contribution

The study demonstrates the limitations of using administrative data for predicting critical illness and highlights the need for additional data types.

## Key findings

- Approximately 0.38% of the yearly cohort experienced near-future critical illness.
- Socioeconomic status was the most influential variable in predicting critical illness.
- The model performed well in training data but poorly in test data, indicating overfitting.

## Abstract

Prediction of future critical illness could render it practical to test interventions seeking to avoid or delay the coming event.

Identify adults having >33% probability of near-future critical illness.

Retrospective cohort study, 2013–2015.

Community-dwelling residents of Manitoba, Canada, aged 40–89 years.

The outcome was a near-future critical illness, defined as intensive care unit admission with invasive mechanical ventilation, or non-palliative death occurring 30–180 days after 1 April each year. By dividing the data into training and test cohorts, a Classification and Regression Tree analysis was used to identify subgroups with ≥33% probability of the outcome. We considered 72 predictors including sociodemographics, chronic conditions, frailty, and health care utilization. Sensitivity analysis used logistic regression methods.

Approximately 0.38% of each yearly cohort experienced near-future critical illness. The optimal Tree identified 2,644 mutually exclusive subgroups. Socioeconomic status was the most influential variable, followed by nursing home residency and frailty; age was sixth. In the training data, the model performed well; 41 subgroups containing 493 subjects had ≥33% members who developed the outcome. However, in the test data, those subgroups contained 429 individuals, with 20 (4.7%) experiencing the outcome, which comprised 0.98% of all subjects with the outcome. While logistic regression showed less model overfitting, it likewise failed to achieve the stated objective.

High-fidelity prediction of near-future critical illness among community-dwelling adults was not successful using population-based administrative data. Additional research is needed to ascertain whether the inclusion of additional types of data can achieve this goal.

## Full-text entities

- **Diseases:** death (MESH:D003643), Critical Illness (MESH:D016638), frailty (MESH:D000073496)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10910992/full.md

## References

66 references — full list in the complete paper: https://tomesphere.com/paper/PMC10910992/full.md

---
Source: https://tomesphere.com/paper/PMC10910992