Can counterfactual explanations of AI systems' predictions skew lay   users' causal intuitions about the world? If so, can we correct for that?

Marko Tesic; Ulrike Hahn

arXiv:2205.06241·cs.AI·December 14, 2022·1 cites

Can counterfactual explanations of AI systems' predictions skew lay users' causal intuitions about the world? If so, can we correct for that?

Marko Tesic, Ulrike Hahn

PDF

Open Access 1 Repo

TL;DR

Counterfactual explanations in AI can unjustifiably influence people's causal beliefs, but explicitly clarifying that AI models capture correlations, not causality, can mitigate this effect.

Contribution

This study empirically investigates how counterfactual explanations affect lay causal beliefs and proposes a simple correction method to reduce misconceptions.

Findings

01

CF explanations influence causal beliefs about AI features

02

Clarifying AI's correlation-based nature reduces belief distortion

03

Experiment results support correction approach effectiveness

Abstract

Counterfactual (CF) explanations have been employed as one of the modes of explainability in explainable AI-both to increase the transparency of AI systems and to provide recourse. Cognitive science and psychology, however, have pointed out that people regularly use CFs to express causal relationships. Most AI systems are only able to capture associations or correlations in data so interpreting them as casual would not be justified. In this paper, we present two experiment (total N = 364) exploring the effects of CF explanations of AI system's predictions on lay people's causal beliefs about the real world. In Experiment 1 we found that providing CF explanations of an AI system's predictions does indeed (unjustifiably) affect people's causal beliefs regarding factors/features the AI uses and that people are more likely to view them as causal factors in the real world. Inspired by the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

markotesic/counterfactual_explanations_XAI
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Explainable Artificial Intelligence (XAI) · Decision-Making and Behavioral Economics