A Series of Unfortunate Counterfactual Events: the Role of Time in Counterfactual Explanations
Andrea Ferrario, Michele Loi

TL;DR
This paper highlights the importance of considering time dependency in counterfactual explanations, revealing how model retraining can lead to 'unfortunate events' that undermine trust and proposing solutions to address this issue.
Contribution
It introduces the concept of 'unfortunate counterfactual events' caused by model updates and offers an approach using explanation histories to mitigate this problem.
Findings
Unfortunate counterfactual events can occur due to model retraining.
These events can undermine trust in AI decision-making.
Proposed methods help maintain trustworthiness in credit lending contexts.
Abstract
Counterfactual explanations are a prominent example of post-hoc interpretability methods in the explainable Artificial Intelligence research domain. They provide individuals with alternative scenarios and a set of recommendations to achieve a sought-after machine learning model outcome. Recently, the literature has identified desiderata of counterfactual explanations, such as feasibility, actionability and sparsity that should support their applicability in real-world contexts. However, we show that the literature has neglected the problem of the time dependency of counterfactual explanations. We argue that, due to their time dependency and because of the provision of recommendations, even feasible, actionable and sparse counterfactual explanations may not be appropriate in real-world applications. This is due to the possible emergence of what we call "unfortunate counterfactual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
MethodsInterpretability
