TL;DR
This paper introduces the Quality Gap Estimate (QGE), a novel method for evaluating explanation quality by comparing explanations to their inverse, improving reliability and insight over traditional random comparison methods.
Contribution
The paper presents QGE, a new approach for explanation evaluation that directly compares explanations to their inverse, enhancing assessment accuracy and reliability.
Findings
QGE outperforms traditional random comparison methods.
QGE improves statistical reliability of explanation quality measures.
QGE provides more insightful evaluation of model explanations.
Abstract
Obtaining high-quality explanations of a model's output enables developers to identify and correct biases, align the system's behavior with human values, and ensure ethical compliance. Explainable Artificial Intelligence (XAI) practitioners rely on specific measures to gauge the quality of such explanations. These measures assess key attributes, such as how closely an explanation aligns with a model's decision process (faithfulness), how accurately it pinpoints the relevant input features (localization), and its consistency across different cases (robustness). Despite providing valuable information, these measures do not fully address a critical practitioner's concern: how does the quality of a given explanation compare to other potential explanations? Traditionally, the quality of an explanation has been assessed by comparing it to a randomly generated counterpart. This paper…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
