An Improved Variational Approximate Posterior for the Deep Wishart Process
Sebastian Ober, Ben Anson, Edward Milsom, Laurence Aitchison

TL;DR
This paper improves the variational inference method for deep Wishart processes by generalizing the approximate posterior, leading to better predictive performance with minimal additional computational cost.
Contribution
It introduces a generalized variational posterior for the deep Wishart process that enhances predictive accuracy over previous methods.
Findings
Enhanced predictive performance on UCI datasets
Negligible increase in computational cost
Generalization of the Bartlett decomposition improves inference
Abstract
Deep kernel processes are a recently introduced class of deep Bayesian models that have the flexibility of neural networks, but work entirely with Gram matrices. They operate by alternately sampling a Gram matrix from a distribution over positive semi-definite matrices, and applying a deterministic transformation. When the distribution is chosen to be Wishart, the model is called a deep Wishart process (DWP). This particular model is of interest because its prior is equivalent to a deep Gaussian process (DGP) prior, but at the same time it is invariant to rotational symmetries, leading to a simpler posterior distribution. Practical inference in the DWP was made possible in recent work ("A variational approximate posterior for the deep Wishart process" Ober and Aitchison 2021a) where the authors used a generalisation of the Bartlett decomposition of the Wishart distribution as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Advanced Statistical Methods and Models
MethodsGaussian Process
