Deep Gaussian Processes for Regression using Approximate Expectation Propagation
Thang D. Bui, Daniel Hern\'andez-Lobato, Yingzhen Li, Jos\'e, Miguel Hern\'andez-Lobato, Richard E. Turner

TL;DR
This paper introduces a new approximate Bayesian learning method for Deep Gaussian Processes using Expectation Propagation, enabling scalable non-linear regression with improved accuracy and uncertainty estimation.
Contribution
It develops an efficient approximate Expectation Propagation algorithm for DGPs, allowing application to larger regression problems and outperforming existing methods.
Findings
Outperforms Gaussian process regression on eleven datasets.
Generally better than state-of-the-art Bayesian neural network methods.
Provides a comprehensive analysis of Bayesian training methods for neural networks.
Abstract
Deep Gaussian processes (DGPs) are multi-layer hierarchical generalisations of Gaussian processes (GPs) and are formally equivalent to neural networks with multiple, infinitely wide hidden layers. DGPs are nonparametric probabilistic models and as such are arguably more flexible, have a greater capacity to generalise, and provide better calibrated uncertainty estimates than alternative deep models. This paper develops a new approximate Bayesian learning scheme that enables DGPs to be applied to a range of medium to large scale regression problems for the first time. The new method uses an approximate Expectation Propagation procedure and a novel and efficient extension of the probabilistic backpropagation algorithm for learning. We evaluate the new method for non-linear regression on eleven real-world datasets, showing that it always outperforms GP regression and is almost always better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Target Tracking and Data Fusion in Sensor Networks · Control Systems and Identification
