Evaluating Explainability: A Framework for Systematic Assessment and Reporting of Explainable AI Features
Miguel A. Lago, Ghada Zamzmi, Brandon Eich, Jana G. Delfino

TL;DR
This paper introduces a comprehensive framework for systematically evaluating and reporting the quality of explainability features in AI, focusing on consistency, plausibility, fidelity, and usefulness, with a case study on medical imaging explanations.
Contribution
It proposes a novel evaluation framework and scorecard for explainable AI, addressing the lack of standardized assessment techniques and providing concrete criteria for explanation quality.
Findings
Framework successfully applied to medical imaging explanations
Evaluation criteria effectively distinguish explanation quality
Case study demonstrates practical utility of the framework
Abstract
Explainability features are intended to provide insight into the internal mechanisms of an AI device, but there is a lack of evaluation techniques for assessing the quality of provided explanations. We propose a framework to assess and report explainable AI features. Our evaluation framework for AI explainability is based on four criteria: 1) Consistency quantifies the variability of explanations to similar inputs, 2) Plausibility estimates how close the explanation is to the ground truth, 3) Fidelity assesses the alignment between the explanation and the model internal mechanisms, and 4) Usefulness evaluates the impact on task performance of the explanation. Finally, we developed a scorecard for AI explainability methods that serves as a complete description and evaluation to accompany this type of algorithm. We describe these four criteria and give examples on how they can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
MethodsClass-activation map
