Rigorous Probabilistic Guarantees for Robust Counterfactual Explanations

Luca Marzari; Francesco Leofante; Ferdinando Cicalese; Alessandro; Farinelli

arXiv:2407.07482·cs.LG·July 11, 2024

Rigorous Probabilistic Guarantees for Robust Counterfactual Explanations

Luca Marzari, Francesco Leofante, Ferdinando Cicalese, Alessandro, Farinelli

PDF

Open Access 1 Repo

TL;DR

This paper introduces a scalable probabilistic framework to assess the robustness of counterfactual explanations against model shifts in deep learning, providing tight guarantees and broad applicability across architectures.

Contribution

It presents the first NP-completeness proof for robustness computation under plausible model shifts and offers a novel probabilistic method that is scalable and architecture-agnostic.

Findings

01

Outperforms existing methods on multiple datasets

02

Provides tight robustness estimates with strong guarantees

03

Enables robustness analysis on diverse neural network architectures

Abstract

We study the problem of assessing the robustness of counterfactual explanations for deep learning models. We focus on $plausible model shifts$ altering model parameters and propose a novel framework to reason about the robustness property in this setting. To motivate our solution, we begin by showing for the first time that computing the robustness of counterfactuals with respect to plausible model shifts is NP-complete. As this (practically) rules out the existence of scalable algorithms for exactly computing robustness, we propose a novel probabilistic approach which is able to provide tight estimates of robustness with strong guarantees while preserving scalability. Remarkably, and differently from existing solutions targeting plausible model shifts, our approach does not impose requirements on the network to be analyzed, thus enabling robustness analysis on a wider range of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lmarza/apas
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI)

MethodsFocus · Counterfactuals Explanations