Scaling Hawkes processes to one million COVID-19 cases
Seyoon Ko, Marc A. Suchard, Andrew J. Holbrook

TL;DR
This paper develops a high-performance computing framework to efficiently analyze one million COVID-19 cases using spatiotemporal Hawkes process models, enabling scalable inference and proposing simpler alternatives to existing methods.
Contribution
It introduces a HPC strategy dividing Markov chains across multiple GPUs for large-scale Hawkes process inference on COVID-19 data, and proposes scalable alternatives for small data settings.
Findings
Successfully analyzed one million COVID-19 cases with HPC methods.
Developed scalable strategies avoiding costly nearest neighbor searches.
Implemented cut-posterior inference to handle spatial uncertainty efficiently.
Abstract
Hawkes stochastic point process models have emerged as valuable statistical tools for analyzing viral contagion. The spatiotemporal Hawkes process characterizes the speeds at which viruses spread within human populations. Unfortunately, likelihood-based inference using these models requires floating-point operations, for the number of observed cases. Recent work responds to the Hawkes likelihood's computational burden by developing efficient graphics processing unit (GPU)-based routines that enable Bayesian analysis of tens-of-thousands of observations. We build on this work and develop a high-performance computing (HPC) strategy that divides 30 Markov chains between 4 GPU nodes, each of which uses multiple GPUs to accelerate its chain's likelihood computations. We use this framework to apply two spatiotemporal Hawkes models to the analysis of one million COVID-19 cases in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities
