TL;DR
This paper introduces Evidential Alignment, a new method that uses uncertainty quantification to reduce reliance on spurious correlations in neural networks, improving out-of-distribution robustness without needing group annotations.
Contribution
It proposes a novel evidential calibration framework that debiases models by understanding their biases through uncertainty, without external annotations or auxiliary models.
Findings
Significantly improves group robustness across architectures.
Effectively suppresses spurious correlations without group labels.
Applicable to diverse data modalities.
Abstract
Deep neural networks often learn and rely on spurious correlations, i.e., superficial associations between non-causal features and the targets. For instance, an image classifier may identify camels based on the desert backgrounds. While it can yield high overall accuracy during training, it degrades generalization on more diverse scenarios where such correlations do not hold. This problem poses significant challenges for out-of-distribution robustness and trustworthiness. Existing methods typically mitigate this issue by using external group annotations or auxiliary deterministic models to learn unbiased representations. However, such information is costly to obtain, and deterministic models may fail to capture the full spectrum of biases learned by the models. To address these limitations, we propose Evidential Alignment, a novel framework that leverages uncertainty quantification to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
