Estimating the epidemic risk using non-uniformly sampled contact data
Julie Fournet, Alain Barrat

TL;DR
This paper develops methods to correct biases in epidemic risk estimates caused by non-uniform contact data sampling, improving accuracy especially in structured populations, with implications for better epidemic modeling.
Contribution
It introduces two surrogate data generation methods to mitigate bias from non-uniform sampling in contact networks, enhancing epidemic risk estimation accuracy.
Findings
Surrogate data improves epidemic risk estimates over raw sampled data.
The second method performs well in strongly structured populations.
Limitations arise in populations with weak group structures.
Abstract
Many datasets describing contacts in a population suffer from incompleteness due to population sampling and underreporting of contacts. Data-driven simulations of spreading processes using such incomplete data lead to an underestimation of the epidemic risk, and it is therefore important to devise methods to correct this bias. We focus here on a non-uniform sampling of the contacts between individuals, aimed at mimicking the results of diaries or surveys, and consider as case studies two datasets collected in different contexts. We show that using surrogate data built using a method developed in the case of uniform population sampling yields an improvement with respect to the use of the sampled data but is strongly limited by the underestimation of the link density in the sampled network. We put forward a second method to build surrogate data that assumes knowledge of the density of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
