Pre-training Epidemic Time Series Forecasters with Compartmental Prototypes
Zewen Liu, Juntong Ni, Bohan Wang, Max S. Y. Lau, Wei Jin

TL;DR
CAPE is a novel pre-trained epidemic forecasting model that leverages compartmental prototypes from simulation data to improve zero-shot predictions across multiple diseases, addressing data scarcity and distribution shifts.
Contribution
It introduces the first open-source pre-trained epidemic model using compartmental prototypes, bridging epidemiological knowledge with time series forecasting.
Findings
Outperforms strong baselines on 17 disease benchmarks
Enables zero-shot epidemic forecasting with improved accuracy
Models epidemic dynamics as mixtures of latent compartmental states
Abstract
Accurate epidemic forecasting is crucial for outbreak preparedness, but existing data-driven models are often brittle. Typically trained on a single pathogen, they struggle with data scarcity during new outbreaks and fail under distribution shifts caused by viral evolution or interventions. However, decades of surveillance data and the design of various compartmental models from diverse diseases offer an untapped source of transferable knowledge. To leverage the collective lessons from history, we propose CAPE, the first open-source pre-trained model for epidemic forecasting. Unlike existing time series foundation models that overlook epidemiological challenges, CAPE models epidemic dynamics as mixtures of latent compartmental population states, termed \textit{compartmental prototypes}. It models a flexible dictionary of compartment prototypes directly from a large collection of…
Peer Reviews
Decision·Submitted to ICLR 2026
1. The motivation for a pre-trained model for epidemic forecasting is important 2. Epidemic regularized used in pre-trained models in novel 3, Results on provided set of diseases is significant and are compared against without pre-trained models
1. Comparison with SOTA epidemic models are lacking (https://arxiv.org/abs/2207.09370 has a few examples) 2. The benchmarks can be expanded with more datasets from wide range of diseases. Eg: see datasets such as (https://www.tycho.pitt.edu/) 3. Are the datasets evaluated on real-time forecasting setup? What is the training validation split. Real-time forecasting setup is more realistic for ongoing epidemics where such models are more useful. 4. Can the model provide probabilistic forecasts? Thi
(1) The adaptation of the Next Generation Matrix (NGM) method to improve the deep learning model looks interesting. (2) There has been a non-trivial effort to engage with the literature in mathematical epidemiology, which is commendable. (3) Paper presentation is good.
(1) The paper presents ideas and theoretical expressions that have already appeared in prior work but fails to cite the corresponding sources, which could mislead a less careful reader into thinking they are novel contributions. For example, the proof of the lower and upper bounds of the Next Generation Matrix (NGM) method is not original – it is a standard linear-algebra result applied to the classical NGM formulation of R_0. However, since the proof is provided without any citation, readers ma
1. The problem formulation is well-founded and accurately captures the key challenges in epidemiological modeling. 2. The proposed algorithm for estimating R0 is convincing, supported by a sound theoretical justification. The experiments are comprehensive, with diverse benchmarks and analyses that demonstrate strong generalization.
1. The work does not provide strong evidence that it effectively addresses non-stationarity in time-series forecasting. The method appears reliable only when the forecasting horizon is shorter than or comparable to the historical window, which does not reflect the harder non-stationary setting. 2. The approach is heavily dependent on the conventional SEIR model and its simulations. If the method relies on such domain-specific priors, it raises the question of why more principled physics-informed
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Machine Learning in Healthcare · Statistical Methods in Epidemiology
