Preserving Causal Constraints in Counterfactual Explanations for Machine   Learning Classifiers

Divyat Mahajan; Chenhao Tan; Amit Sharma

arXiv:1912.03277·cs.LG·June 16, 2020·101 cites

Preserving Causal Constraints in Counterfactual Explanations for Machine Learning Classifiers

Divyat Mahajan, Chenhao Tan, Amit Sharma

PDF

Open Access 2 Repos

TL;DR

This paper develops methods to generate counterfactual explanations for machine learning models that respect causal relationships and real-world feasibility, improving interpretability in critical domains like healthcare and finance.

Contribution

It introduces a causal-aware framework and a feasibility-labeled learning approach for generating more realistic counterfactual explanations.

Findings

01

Generated counterfactuals better satisfy feasibility constraints

02

Proposed methods outperform existing approaches

03

Effective on Bayesian networks and the Adult-Income dataset

Abstract

To construct interpretable explanations that are consistent with the original ML model, counterfactual examples---showing how the model's output changes with small perturbations to the input---have been proposed. This paper extends the work in counterfactual explanations by addressing the challenge of feasibility of such examples. For explanations of ML models in critical domains such as healthcare and finance, counterfactual examples are useful for an end-user only to the extent that perturbation of feature inputs is feasible in the real world. We formulate the problem of feasibility as preserving causal relationships among input features and present a method that uses (partial) structural causal models to generate actionable counterfactuals. When feasibility constraints cannot be easily expressed, we consider an alternative mechanism where people can label generated CF examples on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Machine Learning in Healthcare