# Prediction models of COVID-19 fatality in nine Peruvian provinces: A secondary analysis of the national epidemiological surveillance system

**Authors:** Wendy Nieto-Gutierrez, Jaid Campos-Chambergo, Enrique Gonzalez-Ayala, Oswaldo Oyola-Garcia, Alberti Alejandro-Mora, Eliana Luis-Aguirre, Roly Pasquel-Santillan, Juan Leiva-Aguirre, Cesar Ugarte-Gil, Steev Loyola

PMC · DOI: 10.1371/journal.pgph.0002854 · PLOS Global Public Health · 2024-01-29

## TL;DR

This study developed and validated models to predict COVID-19 fatality in nine Peruvian provinces using national surveillance data to help guide public health decisions.

## Contribution

The study introduces and compares four variable selection strategies for predicting COVID-19 fatality in a Peruvian population.

## Key findings

- Four prediction models were developed and validated using data from 22,098 cases.
- Models from strategies 1 and 4 showed strong performance with AUCs of 0.89 and 0.88, respectively.
- The models were robust in validation and sensitivity analyses.

## Abstract

There are initiatives to promote the creation of predictive COVID-19 fatality models to assist decision-makers. The study aimed to develop prediction models for COVID-19 fatality using population data recorded in the national epidemiological surveillance system of Peru. A retrospective cohort study was conducted (March to September of 2020). The study population consisted of confirmed COVID-19 cases reported in the surveillance system of nine provinces of Lima, Peru. A random sample of 80% of the study population was selected, and four prediction models were constructed using four different strategies to select variables: 1) previously analyzed variables in machine learning models; 2) based on the LASSO method; 3) based on significance; and 4) based on a post-hoc approach with variables consistently included in the three previous strategies. The internal validation was performed with the remaining 20% of the population. Four prediction models were successfully created and validate using data from 22,098 cases. All models performed adequately and similarly; however, we selected models derived from strategy 1 (AUC 0.89, CI95% 0.87–0.91) and strategy 4 (AUC 0.88, CI95% 0.86–0.90). The performance of both models was robust in validation and sensitivity analyses. This study offers insights into estimating COVID-19 fatality within the Peruvian population. Our findings contribute to the advancement of prediction models for COVID-19 fatality and may aid in identifying individuals at increased risk, enabling targeted interventions to mitigate the disease. Future studies should confirm the performance and validate the usefulness of the models described here under real-world conditions and settings.

## Linked entities

- **Diseases:** COVID-19 (MONDO:0100096)

## Full-text entities

- **Diseases:** Dyspnea (MESH:D004417), diabetes (MESH:D003920), viral infections (MESH:D014777), COVID (MESH:D000086382), organic failure (MESH:D009102), frailty (MESH:D000073496), cough (MESH:D003371), inflammation (MESH:D007249), respiratory infections (MESH:D012141), hypertension (MESH:D006973), Death (MESH:D003643), kidney disease (MESH:D007674), hypoxia (MESH:D000860), respiratory failure (MESH:D012131), infection (MESH:D007239), fever (MESH:D005334), tuberculosis (MESH:D014376), obesity (MESH:D009765), chronic kidney disease (MESH:D051436)
- **Species:** Homo sapiens (human, species) [taxon 9606], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10824411/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC10824411/full.md

## References

65 references — full list in the complete paper: https://tomesphere.com/paper/PMC10824411/full.md

---
Source: https://tomesphere.com/paper/PMC10824411