Statistical Criticality arises in Most Informative Representations
Ryan John Cubero, Junghyo Jo, Matteo Marsili, Yasser Roudi, Juyong, Song

TL;DR
This paper demonstrates that statistical criticality, such as Zipf's law, naturally emerges in samples that are maximally informative about the underlying data-generating process, linking information theory and critical phenomena.
Contribution
It introduces the concept of relevance as a measure of informative frequency distributions and shows that maximally informative samples exhibit criticality and Zipf's law without fine tuning.
Findings
Maximally informative samples maximize relevance at a given resolution.
Zipf's law arises at the optimal trade-off between resolution and relevance.
A bound on the maximum number of estimable parameters is derived.
Abstract
We show that statistical criticality, i.e. the occurrence of power law frequency distributions, arises in samples that are maximally informative about the underlying generating process. In order to reach this conclusion, we first identify the frequency with which different outcomes occur in a sample, as the variable carrying useful information on the generative process. The entropy of the frequency, that we call relevance, provides an upper bound to the number of informative bits. This differs from the entropy of the data, that we take as a measure of resolution. Samples that maximise relevance at a given resolution - that we call maximally informative samples - exhibit statistical criticality. In particular, Zipf's law arises at the optimal trade-off between resolution (i.e. compression) and relevance. As a byproduct, we derive a bound of the maximal number of parameters that can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
