Discrepancies in Epidemiological Modeling of Aggregated Heterogeneous Data
Anna L. Trella, Peniel N. Argaw, Michelle M. Li, James A. Hay

TL;DR
This paper demonstrates that epidemiological models often fail to accurately estimate parameters when applied to aggregated, heterogeneous data, highlighting the importance of accounting for complex, multi-source epidemic dynamics.
Contribution
The study empirically evaluates the limitations of current Bayesian inference methods on aggregated epidemic data, revealing biases and challenges in modeling complex outbreak scenarios.
Findings
Simple models produce biased estimates for individual epidemics.
Gaussian process models better capture complex trajectories with sufficient data.
Neglecting heterogeneity can mask underlying epidemic dynamics.
Abstract
Within epidemiological modeling, the majority of analyses assume a single epidemic process for generating ground-truth data. However, this assumed data generation process can be unrealistic, since data sources for epidemics are often aggregated across geographic regions and communities. As a result, state-of-the-art models for estimating epidemiological parameters, e.g.~transmission rates, can be inappropriate when faced with complex systems. Our work empirically demonstrates some limitations of applying epidemiological models to aggregated datasets. We generate three complex outbreak scenarios by combining incidence curves from multiple epidemics that are independently simulated via SEIR models with different sets of parameters. Using these scenarios, we assess the robustness of a state-of-the-art Bayesian inference method that estimates the epidemic trajectory from viral load…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 epidemiological studies · Data-Driven Disease Surveillance · Gaussian Processes and Bayesian Inference
MethodsGaussian Process
