From Neural Activations to Concepts: A Survey on Explaining Concepts in   Neural Networks

Jae Hee Lee; Sergio Lanza; Stefan Wermter

arXiv:2310.11884·cs.AI·May 6, 2024·1 cites

From Neural Activations to Concepts: A Survey on Explaining Concepts in Neural Networks

Jae Hee Lee, Sergio Lanza, Stefan Wermter

PDF

Open Access

TL;DR

This survey reviews recent methods for explaining and integrating concepts within neural networks, highlighting their role in advancing neuro-symbolic AI by linking learning and reasoning.

Contribution

It provides a comprehensive overview of approaches for explaining and inserting concepts in neural networks, emphasizing their importance for neuro-symbolic AI development.

Findings

01

Concept explanations facilitate reasoning integration.

02

Knowledge extraction and insertion enhance neural network interpretability.

03

Survey highlights key methods and future directions in concept-based explainability.

Abstract

In this paper, we review recent approaches for explaining concepts in neural networks. Concepts can act as a natural link between learning and reasoning: once the concepts are identified that a neural learning system uses, one can integrate those concepts with a reasoning system for inference or use a reasoning system to act upon them to improve or enhance the learning system. On the other hand, knowledge can not only be extracted from neural networks but concept knowledge can also be inserted into neural network architectures. Since integrating learning and reasoning is at the core of neuro-symbolic AI, the insights gained from this survey can serve as an important step towards realizing neuro-symbolic AI based on explainable concepts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Explainable Artificial Intelligence (XAI)