Do Explanations Reflect Decisions? A Machine-centric Strategy to Quantify the Performance of Explainability Algorithms
Zhong Qiu Lin, Mohammad Javad Shafiee, Stanislav Bochkarev, Michael, St. Jules, Xiao Yu Wang, and Alexander Wong

TL;DR
This paper introduces a machine-centric approach with quantitative metrics to evaluate the performance of explainability algorithms in deep neural networks, moving beyond subjective assessments.
Contribution
It proposes two novel metrics, Impact Score and Impact Coverage, for objectively measuring explainability methods' effectiveness on neural network decisions.
Findings
GSInquire identified the most impactful regions (~76%)
Impact scores varied across methods, with LIME at ~38% and SHAP at ~44%
The approach offers a more objective evaluation of explainability algorithms
Abstract
There has been a significant surge of interest recently around the concept of explainable artificial intelligence (XAI), where the goal is to produce an interpretation for a decision made by a machine learning algorithm. Of particular interest is the interpretation of how deep neural networks make decisions, given the complexity and `black box' nature of such networks. Given the infancy of the field, there has been very limited exploration into the assessment of the performance of explainability methods, with most evaluations centered around subjective visual interpretation of the produced interpretations. In this study, we explore a more machine-centric strategy for quantifying the performance of explainability methods on deep neural networks via the notion of decision-making impact analysis. We introduce two quantitative performance metrics: i) Impact Score, which assesses the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education
MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations
