Inference in Deep Gaussian Processes using Stochastic Gradient   Hamiltonian Monte Carlo

Marton Havasi; Jos\'e Miguel Hern\'andez-Lobato; Juan Jos\'e; Murillo-Fuentes

arXiv:1806.05490·stat.ML·November 13, 2018·39 cites

Inference in Deep Gaussian Processes using Stochastic Gradient Hamiltonian Monte Carlo

Marton Havasi, Jos\'e Miguel Hern\'andez-Lobato, Juan Jos\'e, Murillo-Fuentes

PDF

Open Access 3 Repos

TL;DR

This paper introduces a novel inference method for Deep Gaussian Processes using Stochastic Gradient Hamiltonian Monte Carlo, which better captures the complex posterior distribution and improves prediction accuracy over traditional Variational Inference.

Contribution

The paper demonstrates the effectiveness of SGHMC for DGP inference and introduces the Moving Window MCEM algorithm for efficient hyperparameter optimization.

Findings

01

SGHMC captures non-Gaussian, multimodal posteriors more effectively.

02

The proposed method outperforms Variational Inference in predictive accuracy.

03

Computational cost is reduced compared to existing state-of-the-art methods.

Abstract

Deep Gaussian Processes (DGPs) are hierarchical generalizations of Gaussian Processes that combine well calibrated uncertainty estimates with the high flexibility of multilayer models. One of the biggest challenges with these models is that exact inference is intractable. The current state-of-the-art inference method, Variational Inference (VI), employs a Gaussian approximation to the posterior distribution. This can be a potentially poor unimodal approximation of the generally multimodal posterior. In this work, we provide evidence for the non-Gaussian nature of the posterior and we apply the Stochastic Gradient Hamiltonian Monte Carlo method to generate samples. To efficiently optimize the hyperparameters, we introduce the Moving Window MCEM algorithm. This results in significantly better predictions at a lower computational cost than its VI counterpart. Thus our method establishes a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks