Exact Gaussian Processes for Massive Datasets via Non-Stationary Sparsity-Discovering Kernels
Marcus M. Noack, Harinarayan Krishnan, Mark D. Risser, Kristofer G., Reyes

TL;DR
This paper introduces a novel approach to Gaussian Processes that leverages naturally-occurring sparsity in data through flexible, non-stationary kernels, enabling exact GP inference on datasets exceeding 5 million points.
Contribution
It proposes a new class of kernels that discover and exploit inherent sparsity, allowing scalable exact Gaussian Process inference without approximation.
Findings
Scales exact GPs to over 5 million data points.
Uses non-stationary, sparsity-discovering kernels.
Achieves high accuracy with large datasets.
Abstract
A Gaussian Process (GP) is a prominent mathematical framework for stochastic function approximation in science and engineering applications. This success is largely attributed to the GP's analytical tractability, robustness, non-parametric structure, and natural inclusion of uncertainty quantification. Unfortunately, the use of exact GPs is prohibitively expensive for large datasets due to their unfavorable numerical complexity of in computation and in storage. All existing methods addressing this issue utilize some form of approximation -- usually considering subsets of the full dataset or finding representative pseudo-points that render the covariance matrix well-structured and sparse. These approximate methods can lead to inaccuracies in function approximations and often limit the user's flexibility in designing expressive kernels. Instead of inducing sparsity via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms
MethodsGaussian Process · Greedy Policy Search
