Scalable Gaussian-process regression and variable selection using Vecchia approximations
Jian Cao, Joseph Guinness, Marc G. Genton, Matthias Katzfuss

TL;DR
This paper introduces VGPR, a scalable Gaussian process regression method that uses Vecchia approximations for efficient variable selection and uncertainty quantification in large datasets.
Contribution
The paper presents a novel scalable algorithm for Gaussian process regression with variable selection, leveraging Vecchia approximations and a new quadratic constrained coordinate descent method.
Findings
VGPR scales to millions of responses and thousands of covariates.
The method improves accuracy over existing approaches.
Theoretical analysis confirms scalability and effectiveness.
Abstract
Gaussian process (GP) regression is a flexible, nonparametric approach to regression that naturally quantifies uncertainty. In many applications, the number of responses and covariates are both large, and a goal is to select covariates that are related to the response. For this setting, we propose a novel, scalable algorithm, coined VGPR, which optimizes a penalized GP log-likelihood based on the Vecchia GP approximation, an ordered conditional approximation from spatial statistics that implies a sparse Cholesky factor of the precision matrix. We traverse the regularization path from strong to weak penalization, sequentially adding candidate covariates based on the gradient of the log-likelihood and deselecting irrelevant covariates via a new quadratic constrained coordinate descent algorithm. We propose Vecchia-based mini-batch subsampling, which provides unbiased gradient estimators.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Spectroscopy and Chemometric Analyses · Machine Learning in Materials Science
