Comparison of new computational methods for geostatistical modelling of malaria
Spencer Wong, Jennifer A. Flegg, Nick Golding, Sevvandi, Kandanaarachchi

TL;DR
This paper compares four fast geostatistical methods for modeling malaria prevalence, highlighting their scalability, accuracy, and sensitivity to assumptions, to guide effective spatial analysis at large scales.
Contribution
It provides an applied comparison of four recent scalable geostatistical methods for malaria data, evaluating their performance and practical considerations.
Findings
INLA and FRK scale well computationally.
SpRF and GPBoost are less scalable for large data.
Model sensitivity requires careful assumption checking.
Abstract
Geostatistical analysis of health data is increasingly used to model spatial variation in malaria prevalence, burden, and other metrics. Traditional inference methods for geostatistical modelling are notoriously computationally intensive, motivating the development of newer, approximate methods. The appeal of faster methods is particularly great as the size of the region and number of spatial locations being modelled increases. Methods We present an applied comparison of four proposed `fast' geostatistical modelling methods and the software provided to implement them -- Integrated Nested Laplace Approximation (INLA), tree boosting with Gaussian processes and mixed effect models (GPBoost), Fixed Rank Kriging (FRK) and Spatial Random Forests (SpRF). We illustrate the four methods by estimating malaria prevalence on two different spatial scales -- country and continent. We compare the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock
