No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions

Emily Schiller; Teodor Chiaburu; Marco Zullich; Luca Longo

arXiv:2603.24524·cs.LG·March 26, 2026

No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions

Emily Schiller, Teodor Chiaburu, Marco Zullich, Luca Longo

PDF

Open Access

TL;DR

This paper introduces a comprehensive multi-dimensional evaluation framework for uncertainty attributions in explainable AI, addressing inconsistencies in current assessment methods and providing a systematic way to compare different approaches.

Contribution

It aligns uncertainty attribution evaluation with the Co-12 framework, introduces new properties like conveyance, and demonstrates the framework's effectiveness through extensive experiments.

Findings

01

Gradient-based methods outperform perturbation-based in consistency and conveyance.

02

Monte-Carlo Dropconnect generally outperforms Monte-Carlo Dropout.

03

Most metrics show low inter-method agreement, indicating no single metric suffices.

Abstract

Research on explainable AI (XAI) has frequently focused on explaining model predictions. More recently, methods have been proposed to explain prediction uncertainty by attributing it to input features (uncertainty attributions). However, the evaluation of these methods remains inconsistent as studies rely on heterogeneous proxy tasks and metrics, hindering comparability. We address this by aligning uncertainty attributions with the well-established Co-12 framework for XAI evaluation. We propose concrete implementations for the correctness, consistency, continuity, and compactness properties. Additionally, we introduce conveyance, a property tailored to uncertainty attributions that evaluates whether controlled increases in epistemic uncertainty reliably propagate to feature-level attributions. We demonstrate our evaluation framework with eight metrics across combinations of uncertainty…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis