Mapping Knowledge Representations to Concepts: A Review and New   Perspectives

Lars Holmberg; Paul Davidsson; Per Linde

arXiv:2301.00189·cs.AI·January 3, 2023

Mapping Knowledge Representations to Concepts: A Review and New Perspectives

Lars Holmberg, Paul Davidsson, Per Linde

PDF

Open Access

TL;DR

This paper reviews methods for linking neural network internal representations to human-understandable concepts, proposing a taxonomy based on deductive explanations and discussing the goals of model explainability.

Contribution

It introduces a taxonomy for neural network explanations using deductive reasoning and causality, and clarifies the distinction between understanding models and providing actionable explanations.

Findings

01

Taxonomy based on deductive explanations and causality.

02

Ambiguity in the goal of explainability: understanding vs. actionability.

03

Insights into expectations and limitations of neural network explanations.

Abstract

The success of neural networks builds to a large extent on their ability to create internal knowledge representations from real-world high-dimensional data, such as images, sound, or text. Approaches to extract and present these representations, in order to explain the neural network's decisions, is an active and multifaceted research field. To gain a deeper understanding of a central aspect of this field, we have performed a targeted review focusing on research that aims to associate internal representations with human understandable concepts. In doing this, we added a perspective on the existing research by using primarily deductive nomological explanations as a proposed taxonomy. We find this taxonomy and theories of causality, useful for understanding what can be expected, and not expected, from neural network explanations. The analysis additionally uncovers an ambiguity in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Neural Networks and Applications