MuyGPs: Scalable Gaussian Process Hyperparameter Estimation Using Local Cross-Validation
Amanda Muyskens, Benjamin Priest, Im\`ene Goumiri, and Michael, Schneider

TL;DR
MuyGPs introduces a scalable Gaussian process hyperparameter estimation technique that leverages local cross-validation and nearest neighbors, significantly improving efficiency and accuracy over existing methods in large spatial datasets.
Contribution
The paper presents MuyGPs, a novel hyperparameter estimation method for Gaussian processes that avoids expensive likelihood calculations by using local cross-validation with nearest neighbors.
Findings
Outperforms state-of-the-art methods in speed and accuracy.
Reduces computational complexity for large datasets.
Achieves lower root mean squared error in spatial predictions.
Abstract
Gaussian processes (GPs) are non-linear probabilistic models popular in many applications. However, na\"ive GP realizations require quadratic memory to store the covariance matrix and cubic computation to perform inference or evaluate the likelihood function. These bottlenecks have driven much investment in the development of approximate GP alternatives that scale to the large data sizes common in modern data-driven applications. We present in this manuscript MuyGPs, a novel efficient GP hyperparameter estimation method. MuyGPs builds upon prior methods that take advantage of the nearest neighbors structure of the data, and uses leave-one-out cross-validation to optimize covariance (kernel) hyperparameters without realizing a possibly expensive likelihood. We describe our model and methods in detail, and compare our implementations against the state-of-the-art competitors in a benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms
