Scalable Gaussian Process Inference with Stan
Till Hoffmann, Jukka-Pekka Onnela

TL;DR
This paper presents scalable methods for Gaussian process inference in Stan, including sparse approximations and FFT-based exact methods, enabling fast inference on large datasets with practical guidance and real-world examples.
Contribution
Introduces two scalable Gaussian process inference methods in Stan, one sparse and one exact using FFT, with implementation details and practical guidance.
Findings
Full posterior inference on 10,000 data points in under 20 seconds
Benchmark results guide method selection for practitioners
Implementation supports Python and R interfaces
Abstract
Gaussian processes (GPs) are sophisticated distributions to model functional data. Whilst theoretically appealing, they are computationally cumbersome except for small datasets. We implement two methods for scaling GP inference in Stan: First, a general sparse approximation using a directed acyclic dependency graph; second, a fast, exact method for regularly spaced data modeled by GPs with stationary kernels using the fast Fourier transform. Based on benchmark experiments, we offer guidance for practitioners to decide between different methods and parameterizations. We consider two real-world examples to illustrate the package. The implementation follows Stan's design and exposes performant inference through a familiar interface. Full posterior inference for ten thousand data points is feasible on a laptop in less than 20 seconds. Details on how to get started using the popular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference
