Measuring Information Transfer in Neural Networks
Xiao Zhang, Xingjian Li, Dejing Dou, Ji Wu

TL;DR
This paper introduces a practical measure called Information Transfer ($L_{IT}$) to quantify the generalizable information in neural networks, aiding analysis of models, datasets, and learning processes.
Contribution
It proposes a new information measure based on prequential coding that estimates the generalizable content of neural networks, with applications in model analysis and transfer learning.
Findings
$L_{IT}$ correlates with generalizable information in models.
$L_{IT}$ helps analyze dataset information and model knowledge.
$L_{IT}$ provides insights into catastrophic forgetting and continual learning.
Abstract
Quantifying the information content in a neural network model is essentially estimating the model's Kolmogorov complexity. Recent success of prequential coding on neural networks points to a promising path of deriving an efficient description length of a model. We propose a practical measure of the generalizable information in a neural network model based on prequential coding, which we term Information Transfer (). Theoretically, is an estimation of the generalizable part of a model's information content. In experiments, we show that is consistently correlated with generalizable information and can be used as a measure of patterns or "knowledge" in a model or a dataset. Consequently, can serve as a useful analysis tool in deep learning. In this paper, we apply to compare and dissect information in datasets, evaluate representation models in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and ELM · Adversarial Robustness in Machine Learning
