Fusing Low-Latency Data Feeds with Death Data to Accurately Nowcast COVID-19 Related Deaths
Conor Rosato, Robert E. Moore, Matthew Carter, John Heap, Jose, Storopoli, Simon Maskell

TL;DR
This paper introduces a machine learning and epidemiological modeling approach that combines social media data with death records to improve real-time COVID-19 death nowcasting accuracy.
Contribution
It presents a novel method that fuses low-latency social media signals with death data using a probabilistic model to enhance COVID-19 death predictions.
Findings
Fusing social media data improves nowcast accuracy.
The model outperforms death-only forecasts.
Real-time social media signals provide valuable early indicators.
Abstract
The emergence of the novel coronavirus (COVID-19) has generated a need to quickly and accurately assemble up-to-date information related to its spread. While it is possible to use deaths to provide a reliable information feed, the latency of data derived from deaths is significant. Confirmed cases derived from positive test results potentially provide a lower latency data feed. However, the sampling of those tested varies with time and the reason for testing is often not recorded. Hospital admissions typically occur around 1-2 weeks after infection and can be considered out of date in relation to the time of initial infection. The extent to which these issues are problematic is likely to vary over time and between countries. We use a machine learning algorithm for natural language processing, trained in multiple languages, to identify symptomatic individuals derived from social media…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · COVID-19 epidemiological studies · Data-Driven Disease Surveillance
