Detecting Information Relays in Deep Neural Networks
Arend Hintze (Dalarna University), Christoph Adami (Michigan State, University)

TL;DR
This paper introduces relay information, an information-theoretic measure, to identify functional modules in deep neural networks, enhancing interpretability and understanding of their internal processes.
Contribution
It proposes a novel relay information metric and a greedy search method to detect and analyze functional modules in artificial neural networks.
Findings
Relay information correlates with module functionality.
The method successfully identifies computational modules.
Enhanced interpretability of neural network processes.
Abstract
Deep learning of artificial neural networks (ANNs) is creating highly functional processes that are, unfortunately, nearly as hard to interpret as their biological counterparts. Identification of functional modules in natural brains plays an important role in cognitive and neuroscience alike, and can be carried out using a wide range of technologies such as fMRI, EEG/ERP, MEG, or calcium imaging. However, we do not have such robust methods at our disposal when it comes to understanding functional modules in artificial neural networks. Ideally, understanding which parts of an artificial neural network perform what function might help us to address a number of vexing problems in ANN research, such as catastrophic forgetting and overfitting. Furthermore, revealing a network's modularity could improve our trust in them by making these black boxes more transparent. Here, we introduce a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Neural Networks and Applications · Adversarial Robustness in Machine Learning
