Remedying uncertainty representations in visual inference through Explaining-Away Variational Autoencoders

Josefina Catoni; Domonkos Martos; Ferenc Csikor; Enzo Ferrante; Diego H. Milone; Bal\'azs Mesz\'ena; Gerg\H{o} Orb\'an; Rodrigo Echeveste

arXiv:2404.15390·cs.LG·March 31, 2026·1 cites

Remedying uncertainty representations in visual inference through Explaining-Away Variational Autoencoders

Josefina Catoni, Domonkos Martos, Ferenc Csikor, Enzo Ferrante, Diego H. Milone, Bal\'azs Mesz\'ena, Gerg\H{o} Orb\'an, Rodrigo Echeveste

PDF

TL;DR

This paper introduces the Explaining-Away VAE (EA-VAE), an extension to standard VAEs that improves uncertainty representation in computer vision tasks by incorporating a global scaling latent variable, inspired by classical vision and neural normalization.

Contribution

The paper proposes a simple yet effective modification to VAEs using a global scaling latent variable, enhancing uncertainty estimation across diverse vision datasets and conditions.

Findings

01

EA-VAE restores normative uncertainty requirements in vision tasks.

02

The introduced scaling latent improves contrast and out-of-distribution uncertainty handling.

03

EA-VAE employs divisive normalization, akin to biological neural networks, to enhance inference.

Abstract

Optimal computations under uncertainty require an adequate probabilistic representation about beliefs. Deep generative models, and specifically Variational Autoencoders (VAEs), have the potential to meet this demand by building latent representations that learn to associate uncertainties with inferences while avoiding their characteristic intractable computations. Yet, we show that it is precisely uncertainty representation that suffers from inconsistencies under an array of relevant computer vision conditions: contrast-dependent computations, image corruption, out-of-distribution detection. Drawing inspiration from classical computer vision, we present a principled extension to the standard VAE by introducing a simple yet powerful inductive bias through a global scaling latent variable, which we call the Explaining-Away VAE (EA-VAE). By applying EA-VAEs to a spectrum of computer vision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.