Tracking disease outbreaks from sparse data with Bayesian inference
Bryan Wilder, Michael J. Mina, Milind Tambe

TL;DR
This paper introduces a Bayesian inference framework for estimating the time-varying reproduction number of disease outbreaks from sparse, partially observed data, accommodating various testing schemes and improving accuracy over standard methods.
Contribution
It develops a novel Bayesian model with Gaussian process priors and a stochastic variational inference method to handle large-scale latent variables in outbreak data analysis.
Findings
Accurately estimates reproduction number from sparse data
Handles diverse testing schemes and partial observability
Produces well-calibrated posterior estimates
Abstract
The COVID-19 pandemic provides new motivation for a classic problem in epidemiology: estimating the empirical rate of transmission during an outbreak (formally, the time-varying reproduction number) from case counts. While standard methods exist, they work best at coarse-grained national or state scales with abundant data, and struggle to accommodate the partial observability and sparse data common at finer scales (e.g., individual schools or towns). For example, case counts may be sparse when only a small fraction of infections are caught by a testing program. Or, whether an infected individual tests positive may depend on the kind of test and the point in time when they are tested. We propose a Bayesian framework which accommodates partial observability in a principled manner. Our model places a Gaussian process prior over the unknown reproduction number at each time step and models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsCOVID-19 epidemiological studies · SARS-CoV-2 and COVID-19 Research · Influenza Virus Research Studies
MethodsGaussian Process
