Understanding Probabilistic Sparse Gaussian Process Approximations
Matthias Bauer, Mark van der Wilk, Carl Edward Rasmussen

TL;DR
This paper compares two popular sparse Gaussian Process approximation methods, FITC and VFE, analyzing their theoretical differences and practical behaviors to guide better application in large-scale regression tasks.
Contribution
It provides a thorough analytical and empirical comparison of FITC and VFE, clarifying their differences and practical implications for Gaussian Process regression.
Findings
FITC and VFE have different theoretical properties.
The two methods behave differently in practice.
Guidelines for choosing between FITC and VFE.
Abstract
Good sparse approximations are essential for practical inference in Gaussian Processes as the computational cost of exact methods is prohibitive for large datasets. The Fully Independent Training Conditional (FITC) and the Variational Free Energy (VFE) approximations are two recent popular methods. Despite superficial similarities, these approximations have surprisingly different theoretical properties and behave differently in practice. We thoroughly investigate the two methods for regression both analytically and through illustrative examples, and draw conclusions to guide practical application.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Control Systems and Identification
