Ensemble-labeling of infectious disease time series to evaluate early warning systems
Andreas Hicketier, Moritz Bach, Philip Oedi, Alexander Ullrich, Auss Abbood

TL;DR
The paper introduces a new method to label disease outbreak data, enabling better evaluation and training of early warning systems for infectious diseases like COVID-19.
Contribution
An adaptive ensemble labeling method for heterogeneous disease time series that improves benchmarking and supervised model training.
Findings
The method consistently produces useful outbreak labels for various outbreak types and spatial resolutions.
Supervised models trained with generated labels outperform traditional unsupervised outbreak detection methods.
The approach allows systematic benchmarking of outbreak detection systems on real surveillance data.
Abstract
Early warning systems (EWSs) for detecting disease outbreaks can help make informed public health decisions and organize necessary responses. During the COVID-19 pandemic, several EWSs were proposed that use covariates such as mobility or social media data for improved timeliness and precision. Evaluating these EWSs is not trivial, since we do not have the ground truth knowledge about outbreaks of COVID-19. Workarounds for missing labels are to simulate them or produce them post hoc. Simulating COVID-19 outbreaks for evaluation is not feasible with highly complex covariates such as mobility. Furthermore, existing post hoc labeling methods do not perform well on heterogeneous COVID-19 time series. To address this evaluation gap, we propose an adaptive labeling method that produces useful labels (time-indexed annotations marking outbreak-like periods) for highly heterogeneous,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20
Figure 21Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Seismology and Earthquake Studies · COVID-19 epidemiological studies
