Structured Uncertainty in the Observation Space of Variational Autoencoders
James Langley, Miguel Monteiro, Charles Jones, Nick Pawlowski, Ben, Glocker

TL;DR
This paper introduces SOS-VAE, a novel observational distribution for variational autoencoders that models spatial dependencies, leading to more coherent and semantically meaningful image samples.
Contribution
It proposes a low-rank parameterization of the observational distribution in VAEs, capturing pixel covariance for improved image synthesis.
Findings
Samples exhibit spatial coherence and semantic variation.
The model captures relevant pixel covariance.
Produces multiple plausible outputs from a single pass.
Abstract
Variational autoencoders (VAEs) are a popular class of deep generative models with many variants and a wide range of applications. Improvements upon the standard VAE mostly focus on the modelling of the posterior distribution over the latent space and the properties of the neural network decoder. In contrast, improving the model for the observational distribution is rarely considered and typically defaults to a pixel-wise independent categorical or normal distribution. In image synthesis, sampling from such distributions produces spatially-incoherent results with uncorrelated pixel noise, resulting in only the sample mean being somewhat useful as an output prediction. In this paper, we aim to stay true to VAE theory by improving the samples from the observational distribution. We propose SOS-VAE, an alternative model for the observation space, encoding spatial dependencies via a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques
