Maximum Entropy competes with Maximum Likelihood
A.E. Allahverdyan, N.H. Martirosyan

TL;DR
This paper analyzes the maximum entropy method within a Bayesian decision framework, establishing its validity limits, comparing it with other estimators, and identifying conditions under which it outperforms regularized maximum likelihood.
Contribution
It introduces a Bayesian decision theory approach to evaluate the applicability of MAXENT, clarifies its limitations, and compares it with ML and Bayesian estimators under different prior conditions.
Findings
MAXENT is effective in sparse data regimes with specific prior information.
MAXENT can outperform regularized ML when prior rank correlations exist.
The study provides a systematic way to assess MAXENT's relevance and limitations.
Abstract
Maximum entropy (MAXENT) method has a large number of applications in theoretical and applied machine learning, since it provides a convenient non-parametric tool for estimating unknown probabilities. The method is a major contribution of statistical physics to probabilistic inference. However, a systematic approach towards its validity limits is currently missing. Here we study MAXENT in a Bayesian decision theory set-up, i.e. assuming that there exists a well-defined prior Dirichlet density for unknown probabilities, and that the average Kullback-Leibler (KL) distance can be employed for deciding on the quality and applicability of various estimators. These allow to evaluate the relevance of various MAXENT constraints, check its general applicability, and compare MAXENT with estimators having various degrees of dependence on the prior, viz. the regularized maximum likelihood (ML) and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Gaussian Processes and Bayesian Inference · Advanced Thermodynamics and Statistical Mechanics
