The Effect of Similarity Measures on Accurate Stability Estimates for Local Surrogate Models in Text-based Explainable AI
Christopher Burger, Charles Walter, Thai Le

TL;DR
This paper examines how different similarity measures impact the assessment of local surrogate models' stability in text-based explainable AI, revealing that some measures can misrepresent vulnerability to adversarial attacks.
Contribution
It systematically evaluates various similarity measures for text explanations, highlighting their influence on stability estimates and identifying potential pitfalls in adversarial robustness assessments.
Findings
Some similarity measures are overly sensitive, exaggerating vulnerabilities.
Other measures are too coarse, underestimating weaknesses.
Choice of similarity measure significantly affects stability conclusions.
Abstract
Recent work has investigated the vulnerability of local surrogate methods to adversarial perturbations on a machine learning (ML) model's inputs, where the explanation is manipulated while the meaning and structure of the original input remains similar under the complex model. Although weaknesses across many methods have been shown to exist, the reasons behind why remain little explored. Central to the concept of adversarial attacks on explainable AI (XAI) is the similarity measure used to calculate how one explanation differs from another. A poor choice of similarity measure can lead to erroneous conclusions on the efficacy of an XAI method. Too sensitive a measure results in exaggerated vulnerability, while too coarse understates its weakness. We investigate a variety of similarity measures designed for text-based ranked lists, including Kendall's Tau, Spearman's Footrule, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
