Examining the causal structures of deep neural networks using   information theory

Simon Mattsson; Eric J. Michaud; Erik Hoel

arXiv:2010.13871·cs.LG·October 28, 2020

Examining the causal structures of deep neural networks using information theory

Simon Mattsson, Eric J. Michaud, Erik Hoel

PDF

Open Access 1 Repo

TL;DR

This paper introduces information-theoretic metrics to analyze the causal structure of deep neural networks, providing insights into how layers influence each other and evolve during training, which can improve understanding and generalization.

Contribution

It proposes a novel suite of metrics based on effective information to quantify and visualize the causal structure of DNNs during training, bridging causality and information theory.

Findings

01

Effective information quantifies causal influence in DNN layers.

02

Layer sensitivity and degeneracy can be tracked over training.

03

Causal properties evolve, affecting generalization and explainability.

Abstract

Deep Neural Networks (DNNs) are often examined at the level of their response to input, such as analyzing the mutual information between nodes and data sets. Yet DNNs can also be examined at the level of causation, exploring "what does what" within the layers of the network itself. Historically, analyzing the causal structure of DNNs has received less attention than understanding their responses to input. Yet definitionally, generalizability must be a function of a DNN's causal structure since it reflects how the DNN responds to unseen or even not-yet-defined future inputs. Here, we introduce a suite of metrics based on information theory to quantify and track changes in the causal structure of DNNs during training. Specifically, we introduce the effective information (EI) of a feedforward DNN, which is the mutual information between layer input and output following a maximum-entropy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ei-research-group/deep-ei
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Adversarial Robustness in Machine Learning · Neural dynamics and brain function