TL;DR
This paper introduces a fully natural gradient optimization scheme for heterogeneous multi-output Gaussian process models, improving inference quality and scalability, especially for models with convolution processes.
Contribution
It proposes a novel natural gradient-based inference method that enhances optimization and extends the model to include convolution processes for better flexibility.
Findings
Achieves better local optima with higher test performance.
Improves scalability of convolutional models via stochastic variational inference.
Outperforms adaptive gradient methods in experiments.
Abstract
A recent novel extension of multi-output Gaussian processes handles heterogeneous outputs assuming that each output has its own likelihood function. It uses a vector-valued Gaussian process prior to jointly model all likelihoods' parameters as latent functions drawn from a Gaussian process with a linear model of coregionalisation covariance. By means of an inducing points framework, the model is able to obtain tractable variational bounds amenable to stochastic variational inference. Nonetheless, the strong conditioning between the variational parameters and the hyper-parameters burdens the adaptive gradient optimisation methods used in the original approach. To overcome this issue we borrow ideas from variational optimisation introducing an exploratory distribution over the hyper-parameters, allowing inference together with the posterior's variational parameters through a fully natural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest · Convolution · Adam · Gaussian Process
