Can counterfactual explanations of AI systems' predictions skew lay users' causal intuitions about the world? If so, can we correct for that?
Marko Tesic, Ulrike Hahn

TL;DR
Counterfactual explanations in AI can unjustifiably influence people's causal beliefs, but explicitly clarifying that AI models capture correlations, not causality, can mitigate this effect.
Contribution
This study empirically investigates how counterfactual explanations affect lay causal beliefs and proposes a simple correction method to reduce misconceptions.
Findings
CF explanations influence causal beliefs about AI features
Clarifying AI's correlation-based nature reduces belief distortion
Experiment results support correction approach effectiveness
Abstract
Counterfactual (CF) explanations have been employed as one of the modes of explainability in explainable AI-both to increase the transparency of AI systems and to provide recourse. Cognitive science and psychology, however, have pointed out that people regularly use CFs to express causal relationships. Most AI systems are only able to capture associations or correlations in data so interpreting them as casual would not be justified. In this paper, we present two experiment (total N = 364) exploring the effects of CF explanations of AI system's predictions on lay people's causal beliefs about the real world. In Experiment 1 we found that providing CF explanations of an AI system's predictions does indeed (unjustifiably) affect people's causal beliefs regarding factors/features the AI uses and that people are more likely to view them as causal factors in the real world. Inspired by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Explainable Artificial Intelligence (XAI) · Decision-Making and Behavioral Economics
