Predictability and Comprehensibility in Post-Hoc XAI Methods: A User-Centered Analysis
Anahid Jalali, Bernhard Haslhofer, Simone Kriglstein, Andreas Rauber

TL;DR
This study evaluates how well users understand and predict black-box model behavior using post-hoc explanations, revealing factors that influence comprehensibility and suggesting design improvements for explainability tools.
Contribution
It provides an empirical user-centered analysis of LIME and SHAP, highlighting how explanations, counterfactuals, and misclassifications affect user understanding.
Findings
SHAP explanations are less comprehensible near decision boundaries
Counterfactual explanations improve user understanding
Misclassifications can enhance predictability
Abstract
Post-hoc explainability methods aim to clarify predictions of black-box machine learning models. However, it is still largely unclear how well users comprehend the provided explanations and whether these increase the users ability to predict the model behavior. We approach this question by conducting a user study to evaluate comprehensibility and predictability in two widely used tools: LIME and SHAP. Moreover, we investigate the effect of counterfactual explanations and misclassifications on users ability to understand and predict the model behavior. We find that the comprehensibility of SHAP is significantly reduced when explanations are provided for samples near a model's decision boundary. Furthermore, we find that counterfactual explanations and misclassifications can significantly increase the users understanding of how a machine learning model is making decisions. Based on our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Data Stream Mining Techniques · Machine Learning and Data Classification
MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations
