Preserving Causal Constraints in Counterfactual Explanations for Machine Learning Classifiers
Divyat Mahajan, Chenhao Tan, Amit Sharma

TL;DR
This paper develops methods to generate counterfactual explanations for machine learning models that respect causal relationships and real-world feasibility, improving interpretability in critical domains like healthcare and finance.
Contribution
It introduces a causal-aware framework and a feasibility-labeled learning approach for generating more realistic counterfactual explanations.
Findings
Generated counterfactuals better satisfy feasibility constraints
Proposed methods outperform existing approaches
Effective on Bayesian networks and the Adult-Income dataset
Abstract
To construct interpretable explanations that are consistent with the original ML model, counterfactual examples---showing how the model's output changes with small perturbations to the input---have been proposed. This paper extends the work in counterfactual explanations by addressing the challenge of feasibility of such examples. For explanations of ML models in critical domains such as healthcare and finance, counterfactual examples are useful for an end-user only to the extent that perturbation of feature inputs is feasible in the real world. We formulate the problem of feasibility as preserving causal relationships among input features and present a method that uses (partial) structural causal models to generate actionable counterfactuals. When feasibility constraints cannot be easily expressed, we consider an alternative mechanism where people can label generated CF examples on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Machine Learning in Healthcare
