Improving Epidemic Analyses with Privacy-Preserving Integration of Sensitive Data
Zihan Guan, Zhiyuan Zhao, Fengwei Tian, Dung Nguyen, Payel Bhattacharjee, Ravi Tandon, B. Aditya Prakash, Anil Vullikanti

TL;DR
This paper presents DPEpiNN, a privacy-preserving framework combining neural networks and mechanistic models for improved epidemic analysis using sensitive data, ensuring privacy while enhancing predictive accuracy.
Contribution
It introduces DPEpiNN, a novel unified framework that integrates deep learning with mechanistic epidemic models under differential privacy guarantees, supporting multiple epidemic tasks.
Findings
Incorporating sensitive data improves forecast accuracy under privacy constraints.
DPEpiNN outperforms baseline models in epidemic forecasting and nowcasting.
The framework provides reliable $R_t$ estimates while maintaining privacy.
Abstract
Epidemic analyses increasingly rely on heterogeneous datasets, many of which are sensitive and require strong privacy protection. Although differential privacy (DP) has become a standard in machine learning and data sharing, its adoption in epidemiological modeling remains limited. In this work, we introduce DPEpiNN, a unified framework that integrates deep neural networks with a mechanistic SEIRM-based metapopulation model under formal DP guarantees. DPEpiNN supports multiple epidemic tasks (including multi-step forecasting, nowcasting, effective reproduction number estimation, and intervention analysis) within a single differentiable pipeline. The framework jointly learns epidemic parameters from heterogeneous public and sensitive datasets, while ensuring privacy via input perturbation mechanisms. We evaluate DPEpiNN using COVID-19 data from three regions. Results show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 epidemiological studies · Data-Driven Disease Surveillance · Privacy-Preserving Technologies in Data
