Using digital traces to build prospective and real-time county-level early warning systems to anticipate COVID-19 outbreaks in the United States
Lucas M. Stolerman, Leonardo Clemente, Canelle Poirier, Kris V. Parag,, Atreyee Majumder, Serge Masyn, Bernd Resch, and Mauricio Santillana

TL;DR
This paper develops machine learning models using digital traces like search data and social media to predict COVID-19 outbreaks at the US county level in real-time, enabling earlier interventions.
Contribution
It introduces a novel real-time, county-level early warning system leveraging diverse digital data sources, improving local outbreak prediction accuracy.
Findings
Predicted COVID-19 surges 1-6 weeks in advance
Effective in 97 US counties tested in real-time
Outperformed baseline models in outbreak anticipation
Abstract
The ongoing COVID-19 pandemic continues to affect communities around the world. To date, almost 6 million people have died as a consequence of COVID-19, and more than one-quarter of a billion people are estimated to have been infected worldwide. The design of appropriate and timely mitigation strategies to curb the effects of this and future disease outbreaks requires close monitoring of their spatio-temporal trajectories. We present machine learning methods to anticipate sharp increases in COVID-19 activity in US counties in real-time. Our methods leverage Internet-based digital traces -- e.g., disease-related Internet search activity from the general population and clinicians, disease-relevant Twitter micro-blogs, and outbreak trajectories from neighboring locations -- to monitor potential changes in population-level health trends. Motivated by the need for finer spatial-resolution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · COVID-19 epidemiological studies · Influenza Virus Research Studies
