Abstraction, Validation, and Generalization for Explainable Artificial   Intelligence

Scott Cheng-Hsin Yang; Tomas Folke; and Patrick Shafto

arXiv:2105.07508·cs.AI·October 14, 2021

Abstraction, Validation, and Generalization for Explainable Artificial Intelligence

Scott Cheng-Hsin Yang, Tomas Folke, and Patrick Shafto

PDF

Open Access

TL;DR

This paper introduces Bayesian Teaching as a unifying theoretical framework for explainable AI, enabling systematic decomposition, validation, and generalization of XAI methods to improve understanding and development efficiency.

Contribution

It formalizes explanation as a communication act, decomposes XAI into modular components, and promotes validation and recombination for systematic advancement.

Findings

01

Decomposition of XAI methods into four components.

02

Modular validation of each component independently.

03

Recombination of components enables rapid development of new XAI variants.

Abstract

Neural network architectures are achieving superhuman performance on an expanding range of tasks. To effectively and safely deploy these systems, their decision-making must be understandable to a wide range of stakeholders. Methods to explain AI have been proposed to answer this challenge, but a lack of theory impedes the development of systematic abstractions which are necessary for cumulative knowledge gains. We propose Bayesian Teaching as a framework for unifying explainable AI (XAI) by integrating machine learning and human learning. Bayesian Teaching formalizes explanation as a communication act of an explainer to shift the beliefs of an explainee. This formalization decomposes any XAI method into four components: (1) the inference to be explained, (2) the explanatory medium, (3) the explainee model, and (4) the explainer model. The abstraction afforded by Bayesian Teaching to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Adversarial Robustness in Machine Learning