Measuring Information Transfer in Neural Networks

Xiao Zhang; Xingjian Li; Dejing Dou; Ji Wu

arXiv:2009.07624·cs.LG·December 4, 2020

Measuring Information Transfer in Neural Networks

Xiao Zhang, Xingjian Li, Dejing Dou, Ji Wu

PDF

Open Access

TL;DR

This paper introduces a practical measure called Information Transfer ($L_{IT}$) to quantify the generalizable information in neural networks, aiding analysis of models, datasets, and learning processes.

Contribution

It proposes a new information measure based on prequential coding that estimates the generalizable content of neural networks, with applications in model analysis and transfer learning.

Findings

01

$L_{IT}$ correlates with generalizable information in models.

02

$L_{IT}$ helps analyze dataset information and model knowledge.

03

$L_{IT}$ provides insights into catastrophic forgetting and continual learning.

Abstract

Quantifying the information content in a neural network model is essentially estimating the model's Kolmogorov complexity. Recent success of prequential coding on neural networks points to a promising path of deriving an efficient description length of a model. We propose a practical measure of the generalizable information in a neural network model based on prequential coding, which we term Information Transfer ( $L_{I T}$ ). Theoretically, $L_{I T}$ is an estimation of the generalizable part of a model's information content. In experiments, we show that $L_{I T}$ is consistently correlated with generalizable information and can be used as a measure of patterns or "knowledge" in a model or a dataset. Consequently, $L_{I T}$ can serve as a useful analysis tool in deep learning. In this paper, we apply $L_{I T}$ to compare and dissect information in datasets, evaluate representation models in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Adversarial Robustness in Machine Learning