Don't Explain Noise: Robust Counterfactuals for Randomized Ensembles

Alexandre Forel; Axel Parmentier; Thibaut Vidal

arXiv:2205.14116·cs.LG·March 22, 2024·1 cites

Don't Explain Noise: Robust Counterfactuals for Randomized Ensembles

Alexandre Forel, Axel Parmentier, Thibaut Vidal

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method for generating robust counterfactual explanations for randomized ensemble models, ensuring higher validity and stability of explanations with minimal additional distance.

Contribution

It formalizes the problem of robust counterfactuals for ensembles, links ensemble robustness to base learner robustness, and provides a practical method with theoretical guarantees.

Findings

01

Existing methods have less than 50% validity for naive counterfactuals.

02

Robust counterfactuals achieve higher validity, up to 80-90%.

03

The proposed method maintains low distance increase from initial observations.

Abstract

Counterfactual explanations describe how to modify a feature vector in order to flip the outcome of a trained classifier. Obtaining robust counterfactual explanations is essential to provide valid algorithmic recourse and meaningful explanations. We study the robustness of explanations of randomized ensembles, which are always subject to algorithmic uncertainty even when the training data is fixed. We formalize the generation of robust counterfactual explanations as a probabilistic problem and show the link between the robustness of ensemble models and the robustness of base learners. We develop a practical method with good empirical performance and support it with theoretical guarantees for ensembles of convex base learners. Our results show that existing methods give surprisingly low robustness: the validity of naive counterfactuals is below $50%$ on most data sets and can fall to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexforel/robustcf4rf
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification

MethodsFLIP · Counterfactuals Explanations · Balanced Selection