XAI Handbook: Towards a Unified Framework for Explainable AI
Sebastian Palacio, Adriano Lucieri, Mohsin Munir, J\"orn Hees, Sheraz, Ahmed, Andreas Dengel

TL;DR
This paper introduces a unified theoretical framework for explainable AI that standardizes terminology, guides explanation generation, and enables comparison of different methods like LIME, SHAP, and MDNet.
Contribution
It provides concrete definitions for key XAI terms and a comprehensive framework to evaluate and compare existing explanation methods.
Findings
Framework aligns with explanation and interpretability desiderata
Enables comparison of LIME, SHAP, and MDNet
Highlights advantages and shortcomings of methods
Abstract
The field of explainable AI (XAI) has quickly become a thriving and prolific community. However, a silent, recurrent and acknowledged issue in this area is the lack of consensus regarding its terminology. In particular, each new contribution seems to rely on its own (and often intuitive) version of terms like "explanation" and "interpretation". Such disarray encumbers the consolidation of advances in the field towards the fulfillment of scientific and regulatory demands e.g., when comparing methods or establishing their compliance with respect to biases and fairness constraints. We propose a theoretical framework that not only provides concrete definitions for these terms, but it also outlines all steps necessary to produce explanations and interpretations. The framework also allows for existing contributions to be re-contextualized such that their scope can be measured, thus making…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsShapley Additive Explanations · Local Interpretable Model-Agnostic Explanations
