Rectangularization of Gaussian process regression for optimization of hyperparameters
Sergei Manzhos, Manabu Ihara

TL;DR
This paper introduces a rectangularization technique for Gaussian process regression, improving hyperparameter optimization in high-dimensional, sparse data scenarios, demonstrated on molecular potential energy surfaces.
Contribution
The paper proposes a novel rectangularization method that enhances hyperparameter tuning in Gaussian process regression for high-dimensional, sparse datasets.
Findings
Effective hyperparameter tuning with sparse data
Improved GPR performance in high-dimensional spaces
Validated on molecular potential energy surface
Abstract
Gaussian process regression (GPR) is a powerful machine learning method which has recently enjoyed wider use, in particular in physical sciences. In its original formulation, GPR uses a square matrix of covariances among training data and can be viewed as linear regression problem with equal numbers of training data and basis functions. When data are sparse, avoidance of overfitting and optimization of hyperparameters of GPR are difficult, in particular in high-dimensional spaces where the data sparsity issue cannot practically be resolved by adding more data. Optimal choice of hyperparameters, however, determines success or failure of the application of the GPR method. We show that parameter optimization is facilitated by rectangularization of the defining equation of GPR. On the example of a 15-dimensional molecular potential energy surface we demonstrate that this approach allows…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Gaussian Processes and Bayesian Inference · Protein Structure and Dynamics
