Model-Agnostic Counterfactual Explanations for Consequential Decisions
Amir-Hossein Karimi, Gilles Barthe, Borja Balle, Isabel Valera

TL;DR
This paper introduces a novel, model-agnostic algorithm for generating counterfactual explanations for individual decisions, capable of handling diverse models, data types, and distance metrics with provable optimality.
Contribution
It presents a formal verification-based, satisfiability solving approach that overcomes limitations of existing methods restricted to specific models or differentiable distances.
Findings
Handles non-linear, non-differentiable, non-convex models
Supports heterogeneous data types and multiple distance metrics
Generates diverse, plausible counterfactuals with optimal distances
Abstract
Predictive models are being increasingly used to support consequential decision making at the individual level in contexts such as pretrial bail and loan approval. As a result, there is increasing social and legal pressure to provide explanations that help the affected individuals not only to understand why a prediction was output, but also how to act to obtain a desired outcome. To this end, several works have proposed optimization-based methods to generate nearest counterfactual explanations. However, these methods are often restricted to a particular subset of models (e.g., decision trees or linear models) and differentiable distance functions. In contrast, we build on standard theory and tools from formal verification and propose a novel algorithm that solves a sequence of satisfiability problems, where both the distance function (objective) and predictive model (constraints) are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning
MethodsCounterfactuals Explanations
