Examining the causal structures of deep neural networks using information theory
Simon Mattsson, Eric J. Michaud, Erik Hoel

TL;DR
This paper introduces information-theoretic metrics to analyze the causal structure of deep neural networks, providing insights into how layers influence each other and evolve during training, which can improve understanding and generalization.
Contribution
It proposes a novel suite of metrics based on effective information to quantify and visualize the causal structure of DNNs during training, bridging causality and information theory.
Findings
Effective information quantifies causal influence in DNN layers.
Layer sensitivity and degeneracy can be tracked over training.
Causal properties evolve, affecting generalization and explainability.
Abstract
Deep Neural Networks (DNNs) are often examined at the level of their response to input, such as analyzing the mutual information between nodes and data sets. Yet DNNs can also be examined at the level of causation, exploring "what does what" within the layers of the network itself. Historically, analyzing the causal structure of DNNs has received less attention than understanding their responses to input. Yet definitionally, generalizability must be a function of a DNN's causal structure since it reflects how the DNN responds to unseen or even not-yet-defined future inputs. Here, we introduce a suite of metrics based on information theory to quantify and track changes in the causal structure of DNNs during training. Specifically, we introduce the effective information (EI) of a feedforward DNN, which is the mutual information between layer input and output following a maximum-entropy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning · Neural dynamics and brain function
