Resolution and Relevance Trade-offs in Deep Learning
Juyong Song, Matteo Marsili, Junghyo Jo

TL;DR
This paper investigates the balance between resolution and relevance in deep learning representations, showing that efficient neural networks maximize relevance at a given resolution, often exhibiting power law distributions like Zipf's law.
Contribution
It introduces a novel information-theoretic framework linking resolution, relevance, and noise in deep neural networks, supported by extensive numerical experiments.
Findings
Deep networks extract hierarchical efficient representations.
Power law distributions characterize optimal representations.
The most efficient layer balances relevance and resolution, following Zipf's law.
Abstract
Deep learning has been successfully applied to various tasks, but its underlying mechanism remains unclear. Neural networks associate similar inputs in the visible layer to the same state of hidden variables in deep layers. The fraction of inputs that are associated to the same state is a natural measure of similarity and is simply related to the cost in bits required to represent these inputs. The degeneracy of states with the same information cost provides instead a natural measure of noise and is simply related the entropy of the frequency of states, that we call relevance. Representations with minimal noise, at a given level of similarity (resolution), are those that maximise the relevance. A signature of such efficient representations is that frequency distributions follow power laws. We show, in extensive numerical experiments, that deep neural networks extract a hierarchy of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Gaussian Processes and Bayesian Inference · Statistical Mechanics and Entropy
