Learning probabilities from random observables in high dimensions: the maximum entropy distribution and others
Tomoyuki Obuchi, Simona Cocco, and R\'emi Monasson

TL;DR
This paper investigates how to learn a target probability distribution over binary variables from random observable expectations, analyzing the structure of the compatible distribution space and the effectiveness of maximum entropy methods.
Contribution
It introduces a biased measure over the version space, interpolating between unbiased and maximum entropy distributions, and uses the replica method to analyze phase transitions and learning efficiency.
Findings
Phase transitions improve target distribution learning at critical data amounts.
Maximum entropy distribution is not necessarily closer to the target than other compatible distributions.
Monte Carlo simulations confirm theoretical predictions for small systems.
Abstract
We consider the problem of learning a target probability distribution over a set of binary variables from the knowledge of the expectation values (with this target distribution) of observables, drawn uniformly at random. The space of all probability distributions compatible with these expectation values within some fixed accuracy, called version space, is studied. We introduce a biased measure over the version space, which gives a boost increasing exponentially with the entropy of the distributions and with an arbitrary inverse `temperature' . The choice of allows us to interpolate smoothly between the unbiased measure over all distributions in the version space () and the pointwise measure concentrated at the maximum entropy distribution (). Using the replica method we compute the volume of the version space and other quantities…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Statistical Mechanics and Entropy · Model Reduction and Neural Networks
