Evaluate with the Inverse: Efficient Approximation of Latent Explanation   Quality Distribution

Carlos Eiras-Franco; Anna Hedstr\"om; Marina M.-C. H\"ohne

arXiv:2502.15403·cs.LG·April 23, 2025

Evaluate with the Inverse: Efficient Approximation of Latent Explanation Quality Distribution

Carlos Eiras-Franco, Anna Hedstr\"om, Marina M.-C. H\"ohne

PDF

1 Video

TL;DR

This paper introduces the Quality Gap Estimate (QGE), a novel method for evaluating explanation quality by comparing explanations to their inverse, improving reliability and insight over traditional random comparison methods.

Contribution

The paper presents QGE, a new approach for explanation evaluation that directly compares explanations to their inverse, enhancing assessment accuracy and reliability.

Findings

01

QGE outperforms traditional random comparison methods.

02

QGE improves statistical reliability of explanation quality measures.

03

QGE provides more insightful evaluation of model explanations.

Abstract

Obtaining high-quality explanations of a model's output enables developers to identify and correct biases, align the system's behavior with human values, and ensure ethical compliance. Explainable Artificial Intelligence (XAI) practitioners rely on specific measures to gauge the quality of such explanations. These measures assess key attributes, such as how closely an explanation aligns with a model's decision process (faithfulness), how accurately it pinpoints the relevant input features (localization), and its consistency across different cases (robustness). Despite providing valuable information, these measures do not fully address a critical practitioner's concern: how does the quality of a given explanation compare to other potential explanations? Traditionally, the quality of an explanation has been assessed by comparing it to a randomly generated counterpart. This paper…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Evaluate with the Inverse: Efficient Approximation of Latent Explanation Quality Distribution· underline