Distributed Variational Inference in Sparse Gaussian Process Regression   and Latent Variable Models

Yarin Gal; Mark van der Wilk; Carl E. Rasmussen

arXiv:1402.1389·stat.ML·October 1, 2014·72 cites

Distributed Variational Inference in Sparse Gaussian Process Regression and Latent Variable Models

Yarin Gal, Mark van der Wilk, Carl E. Rasmussen

PDF

Open Access 1 Repo

TL;DR

This paper presents a distributed variational inference method for sparse Gaussian process models, enabling scalable and efficient analysis of large datasets while maintaining model performance.

Contribution

It introduces a novel re-parametrisation that allows Gaussian process inference to be distributed using Map-Reduce, improving scalability for big data applications.

Findings

01

Scales well with data and computational resources.

02

Improves GP performance with increasing data size.

03

Outperforms common models on large datasets.

Abstract

Gaussian processes (GPs) are a powerful tool for probabilistic inference over functions. They have been applied to both regression and non-linear dimensionality reduction, and offer desirable properties such as uncertainty estimates, robustness to over-fitting, and principled ways for tuning hyper-parameters. However the scalability of these models to big datasets remains an active topic of research. We introduce a novel re-parametrisation of variational inference for sparse GP regression and latent variable models that allows for an efficient distributed algorithm. This is done by exploiting the decoupling of the data given the inducing points to re-formulate the evidence lower bound in a Map-Reduce setting. We show that the inference scales well with data and computational resources, while preserving a balanced distribution of the load among the nodes. We further demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

markvdw/GParML
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Control Systems and Identification · Advanced Multi-Objective Optimization Algorithms