Unifying VXAI: A Systematic Review and Framework for the Evaluation of Explainable AI
David Dembinsky, Adriano Lucieri, Stanislav Frolov, Hiba Najjar, Ko Watanabe, Andreas Dengel

TL;DR
This paper provides a comprehensive review and a unified framework for evaluating explainable AI methods, addressing the lack of standardized metrics and protocols to improve trustworthiness and comparability.
Contribution
It conducts a systematic review of XAI evaluation methods and introduces VXAI, a structured framework for standardized assessment and comparison of XAI explanations.
Findings
Identified 362 relevant publications on XAI evaluation
Aggregated 41 metric groups for explanation assessment
Proposed a three-dimensional categorization scheme for explanations
Abstract
Modern AI systems frequently rely on opaque black-box models, most notably Deep Neural Networks, whose performance stems from complex architectures with millions of learned parameters. While powerful, their complexity poses a major challenge to trustworthiness, particularly due to a lack of transparency. Explainable AI (XAI) addresses this issue by providing human-understandable explanations of model behavior. However, to ensure their usefulness and trustworthiness, such explanations must be rigorously evaluated. Despite the growing number of XAI methods, the field lacks standardized evaluation protocols and consensus on appropriate metrics. To address this gap, we conduct a systematic literature review following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines and introduce a unified framework for the eValuation of XAI (VXAI). We identify 362…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
