Remedying uncertainty representations in visual inference through Explaining-Away Variational Autoencoders
Josefina Catoni, Domonkos Martos, Ferenc Csikor, Enzo Ferrante, Diego H. Milone, Bal\'azs Mesz\'ena, Gerg\H{o} Orb\'an, Rodrigo Echeveste

TL;DR
This paper introduces the Explaining-Away VAE (EA-VAE), an extension to standard VAEs that improves uncertainty representation in computer vision tasks by incorporating a global scaling latent variable, inspired by classical vision and neural normalization.
Contribution
The paper proposes a simple yet effective modification to VAEs using a global scaling latent variable, enhancing uncertainty estimation across diverse vision datasets and conditions.
Findings
EA-VAE restores normative uncertainty requirements in vision tasks.
The introduced scaling latent improves contrast and out-of-distribution uncertainty handling.
EA-VAE employs divisive normalization, akin to biological neural networks, to enhance inference.
Abstract
Optimal computations under uncertainty require an adequate probabilistic representation about beliefs. Deep generative models, and specifically Variational Autoencoders (VAEs), have the potential to meet this demand by building latent representations that learn to associate uncertainties with inferences while avoiding their characteristic intractable computations. Yet, we show that it is precisely uncertainty representation that suffers from inconsistencies under an array of relevant computer vision conditions: contrast-dependent computations, image corruption, out-of-distribution detection. Drawing inspiration from classical computer vision, we present a principled extension to the standard VAE by introducing a simple yet powerful inductive bias through a global scaling latent variable, which we call the Explaining-Away VAE (EA-VAE). By applying EA-VAEs to a spectrum of computer vision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
