A Solution to the Ill-Conditioning of Gradient-Enhanced Covariance Matrices for Gaussian Processes
Andr\'e L. Marchildon, David W. Zingg

TL;DR
This paper introduces a new diagonal preconditioning method with a modest nugget to stabilize gradient-enhanced Gaussian process covariance matrices, improving optimization convergence and avoiding prior drawbacks.
Contribution
A novel preconditioning approach for gradient-enhanced Gaussian processes that bounds the covariance matrix condition number without restricting data or hyperparameters.
Findings
Significantly improved convergence in Bayesian optimization.
Outperforms baseline and rescaling methods in fewer iterations.
Method is implemented in an open-source Python library.
Abstract
Gaussian processes provide probabilistic surrogates for various applications including classification, uncertainty quantification, and optimization. Using a gradient-enhanced covariance matrix can be beneficial since it provides a more accurate surrogate relative to its gradient-free counterpart. An acute problem for Gaussian processes, particularly those that use gradients, is the ill-conditioning of their covariance matrices. Several methods have been developed to address this problem for gradient-enhanced Gaussian processes but they have various drawbacks such as limiting the data that can be used, imposing a minimum distance between evaluation points in the parameter space, or constraining the hyperparameters. In this paper a new method is presented that applies a diagonal preconditioner to the covariance matrix along with a modest nugget to ensure that the condition number of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Spectroscopy and Chemometric Analyses · Target Tracking and Data Fusion in Sensor Networks
