When concept-based XAI is imprecise: Do people distinguish between generalisations and misrepresentations?

Romy M\"uller

arXiv:2506.17936·cs.HC·November 21, 2025

When concept-based XAI is imprecise: Do people distinguish between generalisations and misrepresentations?

Romy M\"uller

PDF

TL;DR

This study examines whether people can distinguish between accurate generalizations and imprecise or misleading concept-based explanations in AI, revealing that users are more sensitive to inaccuracies in relevant features than to overgeneralizations.

Contribution

It provides empirical evidence that users struggle to recognize valid generalizations in concept-based explanations and are more affected by inaccuracies in relevant features.

Findings

01

Participants rated relevant feature inaccuracies as more problematic.

02

Overgeneralizations over less relevant features were rated lower than precise matches.

03

Users are less sensitive to imprecise explanations involving irrelevant features.

Abstract

Concept-based explainable artificial intelligence (C-XAI) can let people see which representations an AI model has learned. This is particularly important when high-level semantic information (e.g., actions and relations) is used to make decisions about abstract categories (e.g., danger). In such tasks, AI models need to generalise beyond situation-specific details, and this ability can be reflected in C-XAI outputs that randomise over irrelevant features. However, it is unclear whether people appreciate such generalisation and can distinguish it from other, less desirable forms of imprecision in C-XAI outputs. Therefore, the present study investigated how the generality and relevance of C-XAI outputs affect people's evaluation of AI. In an experimental railway safety evaluation scenario, participants rated the performance of a simulated AI that classified traffic scenes involving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.