Explaining Data-Driven Decisions made by AI Systems: The Counterfactual   Approach

Carlos Fern\'andez-Lor\'ia; Foster Provost; Xintian Han

arXiv:2001.07417·cs.LG·October 14, 2021·54 cites

Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach

Carlos Fern\'andez-Lor\'ia, Foster Provost, Xintian Han

PDF

Open Access

TL;DR

This paper explores counterfactual explanations for AI decisions, emphasizing causality and irreducibility, and compares them with importance-weight methods like SHAP, demonstrating their advantages through examples and case studies.

Contribution

It introduces a general counterfactual explanation framework for AI decisions and a heuristic for identifying the most relevant explanations, highlighting limitations of importance-weight methods.

Findings

01

Counterfactual explanations better capture causal decision factors.

02

Importance weights may misrepresent feature influence on decisions.

03

Case studies demonstrate advantages of counterfactual explanations over SHAP.

Abstract

We examine counterfactual explanations for explaining the decisions made by model-based AI systems. The counterfactual approach we consider defines an explanation as a set of the system's data inputs that causally drives the decision (i.e., changing the inputs in the set changes the decision) and is irreducible (i.e., changing any subset of the inputs does not change the decision). We (1) demonstrate how this framework may be used to provide explanations for decisions made by general, data-driven AI systems that may incorporate features with arbitrary data types and multiple predictive models, and (2) propose a heuristic procedure to find the most useful explanations depending on the context. We then contrast counterfactual explanations with methods that explain model predictions by weighting features according to their importance (e.g., SHAP, LIME) and present two fundamental reasons…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Machine Learning and Data Classification

MethodsShapley Additive Explanations