How Well Do Feature-Additive Explainers Explain Feature-Additive   Predictors?

Zachariah Carmichael; Walter J. Scheirer

arXiv:2310.18496·cs.LG·October 31, 2023·1 cites

How Well Do Feature-Additive Explainers Explain Feature-Additive Predictors?

Zachariah Carmichael, Walter J. Scheirer

PDF

Open Access 1 Repo

TL;DR

This paper evaluates the fidelity of popular feature-additive explainers like LIME and SHAP in explaining feature-additive models, revealing their limitations especially with feature interactions across various models and datasets.

Contribution

It provides a systematic evaluation of feature-additive explainers against ground truth derived from model structures, highlighting their shortcomings in real-world scenarios.

Findings

01

Explainability methods often fail to accurately attribute feature importance.

02

Performance degrades with complex feature interactions.

03

All tested explainers show limitations in real-world tasks.

Abstract

Surging interest in deep learning from high-stakes domains has precipitated concern over the inscrutable nature of black box neural networks. Explainable AI (XAI) research has led to an abundance of explanation algorithms for these black boxes. Such post hoc explainers produce human-comprehensible explanations, however, their fidelity with respect to the model is not well understood - explanation evaluation remains one of the most challenging issues in XAI. In this paper, we ask a targeted but important question: can popular feature-additive explainers (e.g., LIME, SHAP, SHAPR, MAPLE, and PDP) explain feature-additive predictors? Herein, we evaluate such explainers on ground truth that is analytically derived from the additive structure of a model. We demonstrate the efficacy of our approach in understanding these explainers applied to symbolic expressions, neural networks, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

craymichael/PostHocExplainerEvaluation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Adversarial Robustness in Machine Learning

MethodsLocal Interpretable Model-Agnostic Explanations · High-Order Consensuses · Shapley Additive Explanations