Individualised Counterfactual Examples Using Conformal Prediction Intervals
James M. Adams, Gesine Reinert, Lukasz Szpruch, Carsten Maple, Andrew Elliott

TL;DR
This paper introduces a method for generating individualized counterfactual explanations for black-box classifiers by leveraging conformal prediction intervals to identify the most informative counterfactuals based on the individual's knowledge and prediction uncertainty.
Contribution
It proposes a novel approach combining conformal prediction intervals with counterfactual explanations to personalize and improve the informativeness of counterfactuals for decision understanding.
Findings
Counterfactuals in regions of high uncertainty are more informative.
Synthetic and real datasets demonstrate the effectiveness of CPICFs.
Counterfactuals improve model understanding and data augmentation.
Abstract
Counterfactual explanations for black-box models aim to pr ovide insight into an algorithmic decision to its recipient. For a binary classification problem an individual counterfactual details which features might be changed for the model to infer the opposite class. High-dimensional feature spaces that are typical of machine learning classification models admit many possible counterfactual examples to a decision, and so it is important to identify additional criteria to select the most useful counterfactuals. In this paper, we explore the idea that the counterfactuals should be maximally informative when considering the knowledge of a specific individual about the underlying classifier. To quantify this information gain we explicitly model the knowledge of the individual, and assess the uncertainty of predictions which the individual makes by the width of a conformal prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques · Adversarial Robustness in Machine Learning
MethodsCounterfactuals Explanations · Sparse Evolutionary Training
