ProSpar-GP: scalable Gaussian process modeling with massive   non-stationary datasets

Kevin Li; Simon Mak

arXiv:2311.08752·stat.ME·November 16, 2023·J. Comput. Graph. Stat.·1 cites

ProSpar-GP: scalable Gaussian process modeling with massive non-stationary datasets

Kevin Li, Simon Mak

PDF

Open Access

TL;DR

ProSpar-GP introduces a scalable, non-stationary Gaussian process model using a product-of-experts approach, variational inference, and GPU acceleration, effectively handling massive datasets with complex local variations.

Contribution

It proposes a novel ProSpar-GP method that models non-stationary data with a product-of-experts formulation and a new variational inference approach, enhancing scalability and accuracy.

Findings

01

Outperforms existing methods on benchmark datasets

02

Efficiently handles massive non-stationary data with GPU acceleration

03

Provides stable, Kolmogorov-consistent stochastic process modeling

Abstract

Gaussian processes (GPs) are a popular class of Bayesian nonparametric models, but its training can be computationally burdensome for massive training datasets. While there has been notable work on scaling up these models for big data, existing methods typically rely on a stationary GP assumption for approximation, and can thus perform poorly when the underlying response surface is non-stationary, i.e., it has some regions of rapid change and other regions with little change. Such non-stationarity is, however, ubiquitous in real-world problems, including our motivating application for surrogate modeling of computer experiments. We thus propose a new Product of Sparse GP (ProSpar-GP) method for scalable GP modeling with massive non-stationary data. The ProSpar-GP makes use of a carefully-constructed product-of-experts formulation of sparse GP experts, where different experts are placed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Machine Learning and Data Classification