Abstraction, Validation, and Generalization for Explainable Artificial Intelligence
Scott Cheng-Hsin Yang, Tomas Folke, and Patrick Shafto

TL;DR
This paper introduces Bayesian Teaching as a unifying theoretical framework for explainable AI, enabling systematic decomposition, validation, and generalization of XAI methods to improve understanding and development efficiency.
Contribution
It formalizes explanation as a communication act, decomposes XAI into modular components, and promotes validation and recombination for systematic advancement.
Findings
Decomposition of XAI methods into four components.
Modular validation of each component independently.
Recombination of components enables rapid development of new XAI variants.
Abstract
Neural network architectures are achieving superhuman performance on an expanding range of tasks. To effectively and safely deploy these systems, their decision-making must be understandable to a wide range of stakeholders. Methods to explain AI have been proposed to answer this challenge, but a lack of theory impedes the development of systematic abstractions which are necessary for cumulative knowledge gains. We propose Bayesian Teaching as a framework for unifying explainable AI (XAI) by integrating machine learning and human learning. Bayesian Teaching formalizes explanation as a communication act of an explainer to shift the beliefs of an explainee. This formalization decomposes any XAI method into four components: (1) the inference to be explained, (2) the explanatory medium, (3) the explainee model, and (4) the explainer model. The abstraction afforded by Bayesian Teaching to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning
