TL;DR
This paper demonstrates that optimal MDL codes are critical in a precise sense, generating broad distributions and sitting at a phase transition point, which explains the prevalence of statistical criticality.
Contribution
It reveals that MDL codes are critical and sit at a phase transition, providing a new understanding of their optimality and the nature of statistical criticality.
Findings
MDL codes generate samples with broad empirical distributions.
MDL codes are at a second order phase transition point.
Optimal compression codes do not exist beyond this critical point.
Abstract
In the Minimum Description Length (MDL) principle, learning from the data is equivalent to an optimal coding problem. We show that the codes that achieve optimal compression in MDL are critical in a very precise sense. First, when they are taken as generative models of samples, they generate samples with broad empirical distributions and with a high value of the relevance, defined as the entropy of the empirical frequencies. These results are derived for different statistical models (Dirichlet model, independent and pairwise dependent spin models, and restricted Boltzmann machines). Second, MDL codes sit precisely at a second order phase transition point where the symmetry between the sampled outcomes is spontaneously broken. The order parameter controlling the phase transition is the coding cost of the samples. The phase transition is a manifestation of the optimality of MDL codes, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
