How can I choose an explainer? An Application-grounded Evaluation of Post-hoc Explanations
S\'ergio Jesus, Catarina Bel\'em, Vladimir Balayan, Jo\~ao Bento,, Pedro Saleiro, Pedro Bizarro, Jo\~ao Gama

TL;DR
This paper introduces XAI Test, an application-grounded evaluation methodology, and uses it to assess the real-world impact of popular post-hoc explanation methods on fraud detection decision-making.
Contribution
It provides a practical evaluation framework for explanations and compares the effectiveness of LIME, SHAP, and TreeInterpreter in a real-world setting.
Findings
Data Only yields highest accuracy and slowest decisions.
Explanations improve accuracy over model scores alone.
LIME is least preferred due to explanation variability.
Abstract
There have been several research works proposing new Explainable AI (XAI) methods designed to generate model explanations having specific properties, or desiderata, such as fidelity, robustness, or human-interpretability. However, explanations are seldom evaluated based on their true practical impact on decision-making tasks. Without that assessment, explanations might be chosen that, in fact, hurt the overall performance of the combined system of ML model + end-users. This study aims to bridge this gap by proposing XAI Test, an application-grounded evaluation methodology tailored to isolate the impact of providing the end-user with different levels of information. We conducted an experiment following XAI Test to evaluate three popular post-hoc explanation methods -- LIME, SHAP, and TreeInterpreter -- on a real-world fraud detection task, with real data, a deployed ML model, and fraud…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques · Artificial Intelligence in Healthcare and Education
MethodsLocal Interpretable Model-Agnostic Explanations · Shapley Additive Explanations
