A Human-Centric Assessment Framework for AI
Sascha Saralajew, Ammar Shaker, Zhao Xu, Kiril Gashteovski, and Bhushan Kotnis, Wiem Ben Rim, J\"urgen Quittek, Carolin Lawrence

TL;DR
This paper proposes a human-centric assessment framework for AI, inspired by the Turing test, to evaluate AI systems' performance and explainability through expert acceptance rates.
Contribution
It introduces a novel, human-centric evaluation framework for AI explainability, including two instantiations for assessing accuracy and explanation usefulness.
Findings
Framework effectively compares AI and human expert acceptance rates.
Assessment can evaluate both classification accuracy and explanation usefulness.
Demonstrated applicability through two specific instantiations.
Abstract
With the rise of AI systems in real-world applications comes the need for reliable and trustworthy AI. An essential aspect of this are explainable AI systems. However, there is no agreed standard on how explainable AI systems should be assessed. Inspired by the Turing test, we introduce a human-centric assessment framework where a leading domain expert accepts or rejects the solutions of an AI system and another domain expert. By comparing the acceptance rates of provided solutions, we can assess how the AI system performs compared to the domain expert, and whether the AI system's explanations (if provided) are human-understandable. This setup -- comparable to the Turing test -- can serve as a framework for a wide range of human-centric AI system assessments. We demonstrate this by presenting two instantiations: (1) an assessment that measures the classification accuracy of a system…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI
