TL;DR
FlashSpread is a GPU framework that efficiently simulates complex non-Markovian epidemic models on large contact networks by consolidating multiple steps into a single kernel, achieving significant speedups over CPU methods.
Contribution
It introduces a novel fused kernel approach for GPU-based non-Markovian epidemic simulation, enabling scalable and high-throughput modeling on large networks.
Findings
Achieves 8.09 Giga-NUPS on N=10^6 scale-free graphs with an A100 GPU.
Provides a 217x speedup over optimized CPU tau-leaping methods.
Scales to N=10^8 nodes on a single GPU with extended scale via mixed-precision storage.
Abstract
Non-Markovian (renewal) epidemic simulation on multi-million-node contact networks is essential for realistic forecasting under general age-dependent holding-time distributions (log-normal, Weibull, Erlang, and similar), but the age-dependent hazard forces dense per-step updates that render the sparse event-queue strategies of standard CPU methods ineffective. We present FlashSpread, a GPU framework that consolidates the per-step renewal pipeline (CSR traversal, numerically stable erfcx-based hazard evaluation, Bernoulli tau-leaping, state transition, and next-step infectivity write-back) into a single fused Triton kernel whose intermediates never leave streaming-multiprocessor registers, with block-scalar skips that preserve CUDA Graph capture and a degree-aware CSR dispatch (thread / warp / edge-merge) that keeps the peak throughput on scale-free graphs. On an NVIDIA A100 the fused…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
