A Case Study in Model Failure? COVID-19 Daily Deaths and ICU Bed Utilisation Predictions in New York State
Vincent Chin, Noelle I. Samia, Roman Marchant, Ori Rosen, John P.A., Ioannidis, Martin A. Tanner, Sally Cripps

TL;DR
This study critically evaluates COVID-19 death and ICU bed prediction models in New York, revealing significant inaccuracies and emphasizing the importance of reliable data and real-time validation for trustworthy forecasts.
Contribution
It provides a detailed analysis of prediction inaccuracies and highlights the necessity for rigorous data validation and performance testing of models used in public health decision-making.
Findings
All models performed poorly in prediction accuracy.
Only one model accurately estimated uncertainty, but too late for early decisions.
The IHME ICU model was highly inaccurate until the pandemic waned.
Abstract
Forecasting models have been influential in shaping decision-making in the COVID-19 pandemic. However, there is concern that their predictions may have been misleading. Here, we dissect the predictions made by four models for the daily COVID-19 death counts between March 25 and June 5 in New York state, as well as the predictions of ICU bed utilisation made by the influential IHME model. We evaluated the accuracy of the point estimates and the accuracy of the uncertainty estimates of the model predictions. First, we compared the "ground truth" data sources on daily deaths against which these models were trained. Three different data sources were used by these models, and these had substantial differences in recorded daily death counts. Two additional data sources that we examined also provided different death counts per day. For accuracy of prediction, all models fared very poorly. Only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
