ProSpar-GP: scalable Gaussian process modeling with massive non-stationary datasets
Kevin Li, Simon Mak

TL;DR
ProSpar-GP introduces a scalable, non-stationary Gaussian process model using a product-of-experts approach, variational inference, and GPU acceleration, effectively handling massive datasets with complex local variations.
Contribution
It proposes a novel ProSpar-GP method that models non-stationary data with a product-of-experts formulation and a new variational inference approach, enhancing scalability and accuracy.
Findings
Outperforms existing methods on benchmark datasets
Efficiently handles massive non-stationary data with GPU acceleration
Provides stable, Kolmogorov-consistent stochastic process modeling
Abstract
Gaussian processes (GPs) are a popular class of Bayesian nonparametric models, but its training can be computationally burdensome for massive training datasets. While there has been notable work on scaling up these models for big data, existing methods typically rely on a stationary GP assumption for approximation, and can thus perform poorly when the underlying response surface is non-stationary, i.e., it has some regions of rapid change and other regions with little change. Such non-stationarity is, however, ubiquitous in real-world problems, including our motivating application for surrogate modeling of computer experiments. We thus propose a new Product of Sparse GP (ProSpar-GP) method for scalable GP modeling with massive non-stationary data. The ProSpar-GP makes use of a carefully-constructed product-of-experts formulation of sparse GP experts, where different experts are placed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Machine Learning and Data Classification
