Causality-based Counterfactual Explanation for Classification Models

Tri Dung Duong; Qian Li; Guandong Xu

arXiv:2105.00703·cs.LG·March 28, 2023·1 cites

Causality-based Counterfactual Explanation for Classification Models

Tri Dung Duong, Qian Li, Guandong Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces ProCE, a causal, gradient-free counterfactual explanation framework for classification models that effectively handles mixed feature types and preserves feature relationships.

Contribution

ProCE is the first framework to incorporate causal relationships in counterfactual explanations using a multi-objective genetic algorithm for mixed data types.

Findings

01

ProCE outperforms existing methods in generating realistic counterfactuals.

02

It effectively preserves causal feature relationships.

03

The approach is applicable to various prediction models.

Abstract

Counterfactual explanation is one branch of interpretable machine learning that produces a perturbation sample to change the model's original decision. The generated samples can act as a recommendation for end-users to achieve their desired outputs. Most of the current counterfactual explanation approaches are the gradient-based method, which can only optimize the differentiable loss functions with continuous variables. Accordingly, the gradient-free methods are proposed to handle the categorical variables, which however have several major limitations: 1) causal relationships among features are typically ignored when generating the counterfactuals, possibly resulting in impractical guidelines for decision-makers; 2) the counterfactual explanation algorithm requires a great deal of effort into parameter tuning for dertermining the optimal weight for each loss functions which must be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tridungduong16/multiobj-scm-cf
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · Machine Learning and Data Classification