Clustering the Nearest Neighbor Gaussian Process
Ashlynn Crisp, Daniel Taylor-Rodriguez, and Andrew O. Finley

TL;DR
This paper introduces the clustered Nearest Neighbor Gaussian Process (cNNGP), a scalable approximation that reduces computational costs for large spatial datasets while maintaining inference accuracy comparable to the original NNGP.
Contribution
The paper proposes the cNNGP, which significantly lowers computational and storage costs of NNGP, enabling efficient modeling of large spatial data with minimal loss of accuracy.
Findings
cNNGP achieves similar inference to NNGP with much less computation.
cNNGP reduces memory requirements for large spatial datasets.
Application to biomass data in Maine demonstrates practical efficiency.
Abstract
Gaussian processes are ubiquitous as the primary tool for modeling spatial data. However, the Gaussian process is limited by its cost, making direct parameter fitting algorithms infeasible for the scale of modern data collection initiatives. The Nearest Neighbor Gaussian Process (NNGP) was introduced as a scalable approximation to dense Gaussian processes which has been successful for observations. This project introduces the (cNNGP) which reduces the computational and storage cost of the NNGP. The accuracy of parameter estimation and reduction in computational and memory storage requirements are demonstrated with simulated data, where the cNNGP provided comparable inference to that obtained with the NNGP, in a fraction of the sampling time. To showcase the method's performance, we modeled biomass…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Anomaly Detection Techniques and Applications · Data Management and Algorithms
