Statistical Inference, Occam's Razor and Statistical Mechanics on The Space of Probability Distributions
Vijay Balasubramanian (Princeton University)

TL;DR
This paper models parametric model selection using statistical mechanics, providing a systematic Bayesian framework that naturally incorporates Occam's Razor and introduces a complexity measure called the 'razor' of a model.
Contribution
It extends existing results by applying low-temperature expansions to Bayesian model selection and offers a new interpretation of Jeffreys' prior as a uniform distribution over models.
Findings
Derives a systematic series for Bayesian posterior probability.
Provides a physical interpretation of Occam's Razor within probability theory.
Introduces the 'razor' as a measure of model complexity.
Abstract
The task of parametric model selection is cast in terms of a statistical mechanics on the space of probability distributions. Using the techniques of low-temperature expansions, we arrive at a systematic series for the Bayesian posterior probability of a model family that significantly extends known results in the literature. In particular, we arrive at a precise understanding of how Occam's Razor, the principle that simpler models should be preferred until the data justifies more complex models, is automatically embodied by probability theory. These results require a measure on the space of model parameters and we derive and discuss an interpretation of Jeffreys' prior distribution as a uniform prior over the distributions indexed by a family. Finally, we derive a theoretical index of the complexity of a parametric family relative to some true distribution that we call the {\it razor}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Neural Networks and Applications
