TL;DR
This study compares the effectiveness of closest versus plausible counterfactual explanations in an abstract setting, finding that users benefit more from closest CFEs despite their lower computational plausibility, highlighting the importance of human-centered design in XAI.
Contribution
The paper provides empirical evidence that closest counterfactual explanations outperform plausible ones in user benefit, emphasizing the need to consider human psychology in XAI design.
Findings
Users benefit more from closest CFEs than plausible CFEs.
No difference in subjective user experience between the two types.
Closest CFEs are perceived as more psychologically plausible.
Abstract
Counterfactual explanations (CFEs) highlight what changes to a model's input would have changed its prediction in a particular way. CFEs have gained considerable traction as a psychologically grounded solution for explainable artificial intelligence (XAI). Recent innovations introduce the notion of computational plausibility for automatically generated CFEs, enhancing their robustness by exclusively creating plausible explanations. However, practical benefits of such a constraint on user experience and behavior is yet unclear. In this study, we evaluate objective and subjective usability of computationally plausible CFEs in an iterative learning design targeting novice users. We rely on a novel, game-like experimental design, revolving around an abstract scenario. Our results show that novice users actually benefit less from receiving computationally plausible rather than closest CFEs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
