# Uncovering Neural Learning Dynamics Through Latent Mutual Information

**Authors:** Arianna Issitt, Alex Merino, Lamine Deen, Ryan T. White, Mackenzie J. Meni

PMC · DOI: 10.3390/e28010118 · Entropy · 2026-01-19

## TL;DR

This paper explores how neural networks process information during learning by tracking mutual information, revealing that label-relevant information concentrates in specific channels.

## Contribution

The study introduces a new perspective on representation learning through selective concentration and decorrelation of information.

## Key findings

- Label-relevant mutual information increases with network depth across different architectures.
- High-MI channels are functionally important for accuracy, as confirmed by inference-time experiments.
- A dependence-aware regularizer can improve training speed and accuracy by encouraging desired information patterns.

## Abstract

We study how convolutional neural networks reorganize information during learning in natural image classification tasks by tracking mutual information (MI) between inputs, intermediate representations, and labels. Across VGG-16, ResNet-18, and ResNet-50, we find that label-relevant MI grows reliably with depth while input MI depends strongly on architecture and activation, indicating that “compression’’ is not a universal phenomenon. Within convolutional layers, label information becomes increasingly concentrated in a small subset of channels; inference-time knockouts, shuffles, and perturbations confirm that these high-MI channels are functionally necessary for accuracy. This behavior suggests a view of representation learning driven by selective concentration and decorrelation rather than global information reduction. Finally, we show that a simple dependence-aware regularizer based on the Hilbert–Schmidt Independence Criterion can encourage these same patterns during training, yielding small accuracy gains and consistently faster convergence.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12839813/full.md

## Figures

14 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12839813/full.md

## References

21 references — full list in the complete paper: https://tomesphere.com/paper/PMC12839813/full.md

---
Source: https://tomesphere.com/paper/PMC12839813