Splitting Gaussian Process Regression for Streaming Data

Nick Terry; Youngjun Choe

arXiv:2010.02424·stat.ML·August 26, 2021

Splitting Gaussian Process Regression for Streaming Data

Nick Terry, Youngjun Choe

PDF

TL;DR

This paper introduces a novel local Gaussian process regression method designed for streaming data, significantly improving scalability by partitioning input space and maintaining linear memory complexity.

Contribution

The paper presents a new sequential partitioning algorithm for Gaussian processes that achieves linear memory use and better time complexity, suitable for streaming data applications.

Findings

01

Outperforms existing methods in time and space complexity

02

Achieves linear memory complexity for local Gaussian process models

03

Proven theoretical continuity properties of the model

Abstract

Gaussian processes offer a flexible kernel method for regression. While Gaussian processes have many useful theoretical properties and have proven practically useful, they suffer from poor scaling in the number of observations. In particular, the cubic time complexity of updating standard Gaussian process models make them generally unsuitable for application to streaming data. We propose an algorithm for sequentially partitioning the input space and fitting a localized Gaussian process to each disjoint region. The algorithm is shown to have superior time and space complexity to existing methods, and its sequential nature permits application to streaming data. The algorithm constructs a model for which the time complexity of updating is tightly bounded above by a pre-specified parameter. To the best of our knowledge, the model is the first local Gaussian process regression model to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGaussian Process