# Baseline predictors for 28-day COVID-19 severity and mortality among hospitalized patients: results from the IMPACC study

**Authors:** Jintong Hou, Benjamin Haslund-Gourley, Joann Diray-Arce, Annmarie Hoch, Nadine Rouphael, Patrice M. Becker, Alison D. Augustine, Al Ozonoff, Leying Guan, Steven H. Kleinstein, Bjoern Peters, Elaine Reed, Matt Altman, Charles R. Langelier, Holden Maecker, Seunghee Kim, Ruth R. Montgomery, Florian Krammer, Michael Wilson, Walter Eckalbar, Steven E. Bosinger, Ofer Levy, Hanno Steen, Lindsey B. Rosen, Lindsey R. Baden, Esther Melamed, Lauren I. R. Ehrlich, Grace A. McComsey, Rafick P. Sekaly, Joanna Schaenman, Albert C. Shaw, David A. Hafler, David B. Corry, Farrah Kheradmand, Mark A. Atkinson, Scott C. Brakenridge, Nelson I. Agudelo Higuita, Jordan P. Metcalf, Catherine L. Hough, William B. Messer, Bali Pulendran, Kari C. Nadeau, Mark M. Davis, Ana Fernandez Sesma, Viviana Simon, Monica Kraft, Chris Bime, Carolyn S. Calfee, David J. Erle, IMPACC Network, Lucy F. Robinson, Charles B. Cairns, Elias K. Haddad, Mary Ann Comunale

PMC · DOI: 10.3389/fmed.2025.1604388 · Frontiers in Medicine · 2025-07-04

## TL;DR

This study identifies baseline predictors for severe illness and death in hospitalized COVID-19 patients using clinical and lab data.

## Contribution

The study introduces machine learning models that outperform existing scores for predicting disease severity and mortality.

## Key findings

- The SpO2/FiO2 ratio is the strongest predictor for both severity and mortality.
- Adding biomarkers like IL-6 and TNFRSF11B modestly improves prediction accuracy.
- The models outperform the SOFA score for inpatients and match it for ICU patients.

## Abstract

The coronavirus disease 2019 (COVID-19) pandemic threatened public health and placed a significant burden on medical resources. The Immunophenotyping Assessment in a COVID-19 Cohort (IMPACC) study collected clinical, demographic, blood cytometry, serum receptor-binding domain (RBD) antibody titers, metabolomics, targeted proteomics, nasal metagenomics, Olink, nasal viral load, autoantibody, SARS-CoV-2 antibody titers, and nasal and peripheral blood mononuclear cell (PBMC) transcriptomics data from patients hospitalized with COVID-19. The aim of this study is to select baseline biomarkers and build predictive models for 28-day in-hospital COVID-19 severity and mortality with most predictive variables while prioritizing routinely collected variables.

We analyzed 1102 hospitalized COVID-19 participants. We used the lasso and forward selection to select top predictors for severity and mortality, and built predictive models based on balanced training data. We then validated the models on testing data.

Severity was best predicted by the baseline SpO2/FiO2 ratio obtained from COVID-19 patients (test AUC: 0.874). Adding patient age, BMI, FGF23, IL-6, and LTA to the disease severity prediction model improves the test AUC by an additional 3%. The clinical mortality prediction model using SpO2/FiO2 ratio, age, and BMI resulted in a test AUC of 0.83. Adding laboratory results such as TNFRSF11B and plasma ribitol count increased the prediction model by 3.5%. The severity and mortality prediction models developed outperform the Sequential Organ Failure Assessment (SOFA) score among inpatients and perform similarly to the SOFA score among ICU patients.

This study identifies clinical data and laboratory biomarkers of COVID-19 severity and mortality using machine learning models. The study identifies SpO2/FiO2 ratio to be the most important predictor for both severity and mortality. Several biomarkers were identified to modestly improve the predictions. The results also provide a baseline of SARS-CoV-2 infection during the early stages of the coronavirus emergence and can serve as a baseline for future studies that inform how the genetic evolution of the coronavirus affects the host response to new variants.

## Linked entities

- **Proteins:** IL6 (interleukin 6), LTA (lymphotoxin alpha), TNFRSF11B (TNF receptor superfamily member 11b), FGF23 (fibroblast growth factor 23)
- **Diseases:** COVID-19 (MONDO:0100096)

## Full-text entities

- **Genes:** TNFRSF11B (TNF receptor superfamily member 11b) [NCBI Gene 4982] {aka OCIF, OPG, PDB5, TR1}, FGF23 (fibroblast growth factor 23) [NCBI Gene 8074] {aka ADHR, FGFN, HFTC2, HPDR2, HYPF, PHPTC}, IL6 (interleukin 6) [NCBI Gene 3569] {aka BSF-2, BSF2, CDF, HGF, HSF, IFN-beta-2}
- **Diseases:** COVID-19 (MESH:D000086382)
- **Chemicals:** ribitol (MESH:D012255), LTA (MESH:D017572)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606], Gammacoronavirus (genus) [taxon 694013]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12271175/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12271175/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/PMC12271175/full.md

---
Source: https://tomesphere.com/paper/PMC12271175