Learning to Compress: Local Rank and Information Compression in Deep   Neural Networks

Niket Patel; Ravid Shwartz-Ziv

arXiv:2410.07687·cs.LG·October 11, 2024

Learning to Compress: Local Rank and Information Compression in Deep Neural Networks

Niket Patel, Ravid Shwartz-Ziv

PDF

Open Access

TL;DR

This paper explores how deep neural networks tend to learn low-dimensional feature representations by reducing local rank during training, linking this behavior to information compression and the Information Bottleneck theory.

Contribution

It introduces the concept of local rank as a measure of feature manifold dimensionality and demonstrates its decrease during training, connecting rank reduction to mutual information compression.

Findings

01

Local rank decreases during the final training phase.

02

Rank reduction correlates with mutual information compression.

03

Theoretical and empirical evidence supports the link between rank and information bottleneck.

Abstract

Deep neural networks tend to exhibit a bias toward low-rank solutions during training, implicitly learning low-dimensional feature representations. This paper investigates how deep multilayer perceptrons (MLPs) encode these feature manifolds and connects this behavior to the Information Bottleneck (IB) theory. We introduce the concept of local rank as a measure of feature manifold dimensionality and demonstrate, both theoretically and empirically, that this rank decreases during the final phase of training. We argue that networks that reduce the rank of their learned representations also compress mutual information between inputs and intermediate layers. This work bridges the gap between feature manifold rank and information compression, offering new insights into the interplay between information bottlenecks and representation learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications