When concept-based XAI is imprecise: Do people distinguish between generalisations and misrepresentations?
Romy M\"uller

TL;DR
This study examines whether people can distinguish between accurate generalizations and imprecise or misleading concept-based explanations in AI, revealing that users are more sensitive to inaccuracies in relevant features than to overgeneralizations.
Contribution
It provides empirical evidence that users struggle to recognize valid generalizations in concept-based explanations and are more affected by inaccuracies in relevant features.
Findings
Participants rated relevant feature inaccuracies as more problematic.
Overgeneralizations over less relevant features were rated lower than precise matches.
Users are less sensitive to imprecise explanations involving irrelevant features.
Abstract
Concept-based explainable artificial intelligence (C-XAI) can let people see which representations an AI model has learned. This is particularly important when high-level semantic information (e.g., actions and relations) is used to make decisions about abstract categories (e.g., danger). In such tasks, AI models need to generalise beyond situation-specific details, and this ability can be reflected in C-XAI outputs that randomise over irrelevant features. However, it is unclear whether people appreciate such generalisation and can distinguish it from other, less desirable forms of imprecision in C-XAI outputs. Therefore, the present study investigated how the generality and relevance of C-XAI outputs affect people's evaluation of AI. In an experimental railway safety evaluation scenario, participants rated the performance of a simulated AI that classified traffic scenes involving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
