Analyzing Explainer Robustness via Probabilistic Lipschitzness of   Prediction Functions

Zulqarnain Khan; Davin Hill; Aria Masoomi; Joshua Bone; and Jennifer; Dy

arXiv:2206.12481·cs.LG·April 17, 2024

Analyzing Explainer Robustness via Probabilistic Lipschitzness of Prediction Functions

Zulqarnain Khan, Davin Hill, Aria Masoomi, Joshua Bone, and Jennifer, Dy

PDF

Open Access

TL;DR

This paper introduces a formal framework linking the robustness of explanation methods to the probabilistic Lipschitzness of prediction functions, providing theoretical guarantees and empirical validation for explanation stability.

Contribution

It formalizes explainer robustness through explainer astuteness and connects it to the predictor's probabilistic Lipschitzness, offering new theoretical bounds and empirical insights.

Findings

01

Locally smooth prediction functions lead to robust explanations.

02

Theoretical lower bounds on explainer astuteness based on Lipschitzness.

03

Empirical validation on simulated and real datasets supports the theory.

Abstract

Machine learning methods have significantly improved in their predictive capabilities, but at the same time they are becoming more complex and less transparent. As a result, explainers are often relied on to provide interpretability to these black-box prediction models. As crucial diagnostics tools, it is important that these explainers themselves are robust. In this paper we focus on one particular aspect of robustness, namely that an explainer should give similar explanations for similar data inputs. We formalize this notion by introducing and defining explainer astuteness, analogous to astuteness of prediction functions. Our formalism allows us to connect explainer robustness to the predictor's probabilistic Lipschitzness, which captures the probability of local smoothness of a function. We provide lower bound guarantees on the astuteness of a variety of explainers (e.g., SHAP, RISE,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification

MethodsShapley Additive Explanations