Understanding Probabilistic Sparse Gaussian Process Approximations

Matthias Bauer; Mark van der Wilk; Carl Edward Rasmussen

arXiv:1606.04820·stat.ML·May 31, 2017·NeurIPS·65 cites

Understanding Probabilistic Sparse Gaussian Process Approximations

Matthias Bauer, Mark van der Wilk, Carl Edward Rasmussen

PDF

Open Access

TL;DR

This paper compares two popular sparse Gaussian Process approximation methods, FITC and VFE, analyzing their theoretical differences and practical behaviors to guide better application in large-scale regression tasks.

Contribution

It provides a thorough analytical and empirical comparison of FITC and VFE, clarifying their differences and practical implications for Gaussian Process regression.

Findings

01

FITC and VFE have different theoretical properties.

02

The two methods behave differently in practice.

03

Guidelines for choosing between FITC and VFE.

Abstract

Good sparse approximations are essential for practical inference in Gaussian Processes as the computational cost of exact methods is prohibitive for large datasets. The Fully Independent Training Conditional (FITC) and the Variational Free Energy (VFE) approximations are two recent popular methods. Despite superficial similarities, these approximations have surprisingly different theoretical properties and behave differently in practice. We thoroughly investigate the two methods for regression both analytically and through illustrative examples, and draw conclusions to guide practical application.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Control Systems and Identification