No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions
Emily Schiller, Teodor Chiaburu, Marco Zullich, Luca Longo

TL;DR
This paper introduces a comprehensive multi-dimensional evaluation framework for uncertainty attributions in explainable AI, addressing inconsistencies in current assessment methods and providing a systematic way to compare different approaches.
Contribution
It aligns uncertainty attribution evaluation with the Co-12 framework, introduces new properties like conveyance, and demonstrates the framework's effectiveness through extensive experiments.
Findings
Gradient-based methods outperform perturbation-based in consistency and conveyance.
Monte-Carlo Dropconnect generally outperforms Monte-Carlo Dropout.
Most metrics show low inter-method agreement, indicating no single metric suffices.
Abstract
Research on explainable AI (XAI) has frequently focused on explaining model predictions. More recently, methods have been proposed to explain prediction uncertainty by attributing it to input features (uncertainty attributions). However, the evaluation of these methods remains inconsistent as studies rely on heterogeneous proxy tasks and metrics, hindering comparability. We address this by aligning uncertainty attributions with the well-established Co-12 framework for XAI evaluation. We propose concrete implementations for the correctness, consistency, continuity, and compactness properties. Additionally, we introduce conveyance, a property tailored to uncertainty attributions that evaluates whether controlled increases in epistemic uncertainty reliably propagate to feature-level attributions. We demonstrate our evaluation framework with eight metrics across combinations of uncertainty…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis
