Minimax Estimation of Discrete Distributions under $\ell_1$ Loss
Yanjun Han, Jiantao Jiao, Tsachy Weissman

TL;DR
This paper investigates the fundamental limits of estimating discrete distributions under $\, ext{ extonehalf}$ loss, providing bounds and proposing estimators that are asymptotically minimax, especially when the alphabet size grows with data.
Contribution
It derives non-asymptotic bounds for empirical and minimax risks, and introduces a hard-thresholding estimator that achieves asymptotic minimaxity without knowing the entropy bound.
Findings
Empirical distribution risk asymptotically $2H/\, ext{ln}\,n$
Minimax risk asymptotically $H/\, ext{ln}\,n$ for bounded entropy
A simple hard-thresholding estimator is asymptotically minimax
Abstract
We analyze the problem of discrete distribution estimation under loss. We provide non-asymptotic upper and lower bounds on the maximum risk of the empirical distribution (the maximum likelihood estimator), and the minimax risk in regimes where the alphabet size may grow with the number of observations . We show that among distributions with bounded entropy , the asymptotic maximum risk for the empirical distribution is , while the asymptotic minimax risk is . Moreover, Moreover, we show that a hard-thresholding estimator oblivious to the unknown upper bound , is asymptotically minimax. However, if we constrain the estimates to lie in the simplex of probability distributions, then the asymptotic minimax risk is again . We draw connections between our work and the literature on density estimation, entropy estimation, total variation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Bayesian Modeling and Causal Inference
