Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach
Carlos Fern\'andez-Lor\'ia, Foster Provost, Xintian Han

TL;DR
This paper explores counterfactual explanations for AI decisions, emphasizing causality and irreducibility, and compares them with importance-weight methods like SHAP, demonstrating their advantages through examples and case studies.
Contribution
It introduces a general counterfactual explanation framework for AI decisions and a heuristic for identifying the most relevant explanations, highlighting limitations of importance-weight methods.
Findings
Counterfactual explanations better capture causal decision factors.
Importance weights may misrepresent feature influence on decisions.
Case studies demonstrate advantages of counterfactual explanations over SHAP.
Abstract
We examine counterfactual explanations for explaining the decisions made by model-based AI systems. The counterfactual approach we consider defines an explanation as a set of the system's data inputs that causally drives the decision (i.e., changing the inputs in the set changes the decision) and is irreducible (i.e., changing any subset of the inputs does not change the decision). We (1) demonstrate how this framework may be used to provide explanations for decisions made by general, data-driven AI systems that may incorporate features with arbitrary data types and multiple predictive models, and (2) propose a heuristic procedure to find the most useful explanations depending on the context. We then contrast counterfactual explanations with methods that explain model predictions by weighting features according to their importance (e.g., SHAP, LIME) and present two fundamental reasons…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Machine Learning and Data Classification
MethodsShapley Additive Explanations
