When Gaussian Process Meets Big Data: A Review of Scalable GPs
Haitao Liu, Yew-Soon Ong, Xiaobo Shen, Jianfei Cai

TL;DR
This paper provides a comprehensive review of scalable Gaussian processes, categorizing methods into global and local approximations, and discusses recent advances, challenges, and future research directions in handling big data.
Contribution
It offers the first detailed survey of scalable GPs, systematically analyzing global and local approximation techniques and highlighting open issues for future research.
Findings
Global approximations include sparse, prior, and posterior methods.
Local approximations involve mixture and product of experts models.
Recent advances improve scalability and prediction quality.
Abstract
The vast quantity of information brought by big data as well as the evolving computer hardware encourages success stories in the machine learning community. In the meanwhile, it poses challenges for the Gaussian process (GP) regression, a well-known non-parametric and interpretable Bayesian model, which suffers from cubic complexity to data size. To improve the scalability while retaining desirable prediction quality, a variety of scalable GPs have been presented. But they have not yet been comprehensively reviewed and analyzed in order to be well understood by both academia and industry. The review of scalable GPs in the GP community is timely and important due to the explosion of data size. To this end, this paper is devoted to the review on state-of-the-art scalable GPs involving two main categories: global approximations which distillate the entire data and local approximations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Control Systems and Identification
MethodsGaussian Process
