On the Trade-offs between Adversarial Robustness and Actionable   Explanations

Satyapriya Krishna; Chirag Agarwal; Himabindu Lakkaraju

arXiv:2309.16452·cs.LG·July 25, 2024

On the Trade-offs between Adversarial Robustness and Actionable Explanations

Satyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju

PDF

Open Access

TL;DR

This paper investigates the trade-offs between adversarial robustness and the quality of actionable explanations in machine learning models, revealing that increased robustness can raise recourse costs and lower validity.

Contribution

It provides the first theoretical and empirical analysis of how adversarial robustness affects the cost and validity of recourses in explainable models.

Findings

01

Adversarially robust models increase recourse costs.

02

Robust models decrease the validity of generated recourses.

03

Theoretical bounds quantify these trade-offs.

Abstract

As machine learning models are increasingly being employed in various high-stakes settings, it becomes important to ensure that predictions of these models are not only adversarially robust, but also readily explainable to relevant stakeholders. However, it is unclear if these two notions can be simultaneously achieved or if there exist trade-offs between them. In this work, we make one of the first attempts at studying the impact of adversarially robust models on actionable explanations which provide end users with a means for recourse. We theoretically and empirically analyze the cost (ease of implementation) and validity (probability of obtaining a positive model prediction) of recourses output by state-of-the-art algorithms when the underlying models are adversarially robust vs. non-robust. More specifically, we derive theoretical bounds on the differences between the cost and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications