Don't Get Your Kroneckers in a Twist: Gaussian Processes on High-Dimensional Incomplete Grids
Mads Greisen H{\o}jlund, August Smart Lykke-M{\o}ller, Henry Moss, Ove Christiansen

TL;DR
The paper presents CUTS-GPR, a scalable Gaussian process regression method for high-dimensional incomplete grids, enabling efficient Bayesian modeling of complex energy surfaces with billions of data points.
Contribution
Introducing CUTS-GPR, a novel kernel matrix-vector product technique that achieves near-linear scaling, facilitating exact GPR in high-dimensional, large-scale datasets.
Findings
Scalable kernel matrix-vector product demonstrated with billions of data points.
Full GPR computations completed in hours for 447,265 points in 24 dimensions.
Enables Bayesian modeling of high-dimensional potential energy surfaces.
Abstract
We introduce CUTS-GPR, a new method for performing numerically exact Gaussian process regression (GPR) in high-dimensional settings. The key component of CUTS-GPR is an extremely fast kernel matrix-vector product, which exhibits near-linear or even linear scaling with the amount of training data, , and low-order polynomial scaling with dimensionality, . This is obtained by combining an additive kernel with an incomplete grid and exploiting the resulting structure of the kernel matrix. We demonstrate the scalability of the matrix-vector product by running benchmarks with billions of data points and thousands of dimensions. Full GPR calculations, including hyperparameter optimization, are completed in a matter of hours for and . We demonstrate that our CUTS-GPR enables Bayesian modeling of high-dimensional potential energy surfaces - a longstanding challenge in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
