GPyTorch: Blackbox Matrix-Matrix Gaussian Process Inference with GPU Acceleration
Jacob R. Gardner, Geoff Pleiss, David Bindel, Kilian Q. Weinberger,, Andrew Gordon Wilson

TL;DR
GPyTorch introduces a GPU-accelerated, blackbox matrix-matrix approach for Gaussian process inference that significantly reduces computational complexity and accelerates both exact and approximate inference methods.
Contribution
The paper presents BBMM, a novel GPU-based inference method for GPs that reduces complexity and improves speed, implemented in the GPyTorch software platform.
Findings
BBMM reduces GP inference complexity from O(n^3) to O(n^2).
GPU acceleration dramatically speeds up exact and approximate GP inference.
GPyTorch enables scalable Gaussian process inference with hardware acceleration.
Abstract
Despite advances in scalable models, the inference tools used for Gaussian processes (GPs) have yet to fully capitalize on developments in computing hardware. We present an efficient and general approach to GP inference based on Blackbox Matrix-Matrix multiplication (BBMM). BBMM inference uses a modified batched version of the conjugate gradients algorithm to derive all terms for training and inference in a single call. BBMM reduces the asymptotic complexity of exact GP inference from to . Adapting this algorithm to scalable approximations and complex GP models simply requires a routine for efficient matrix-matrix multiplication with the kernel and its derivative. In addition, BBMM uses a specialized preconditioner to substantially speed up convergence. In experiments we show that BBMM effectively uses GPU hardware to dramatically accelerate both exact GP inference and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Control Systems and Identification · Neural Networks and Applications
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
