Minimax Estimation of Discrete Distributions under $\ell_1$ Loss

Yanjun Han; Jiantao Jiao; Tsachy Weissman

arXiv:1411.1467·cs.IT·December 31, 2015·5 cites

Minimax Estimation of Discrete Distributions under $\ell_1$ Loss

Yanjun Han, Jiantao Jiao, Tsachy Weissman

PDF

Open Access

TL;DR

This paper investigates the fundamental limits of estimating discrete distributions under $\, ext{ extonehalf}$ loss, providing bounds and proposing estimators that are asymptotically minimax, especially when the alphabet size grows with data.

Contribution

It derives non-asymptotic bounds for empirical and minimax risks, and introduces a hard-thresholding estimator that achieves asymptotic minimaxity without knowing the entropy bound.

Findings

01

Empirical distribution risk asymptotically $2H/\, ext{ln}\,n$

02

Minimax risk asymptotically $H/\, ext{ln}\,n$ for bounded entropy

03

A simple hard-thresholding estimator is asymptotically minimax

Abstract

We analyze the problem of discrete distribution estimation under $ℓ_{1}$ loss. We provide non-asymptotic upper and lower bounds on the maximum risk of the empirical distribution (the maximum likelihood estimator), and the minimax risk in regimes where the alphabet size $S$ may grow with the number of observations $n$ . We show that among distributions with bounded entropy $H$ , the asymptotic maximum risk for the empirical distribution is $2 H / ln n$ , while the asymptotic minimax risk is $H / ln n$ . Moreover, Moreover, we show that a hard-thresholding estimator oblivious to the unknown upper bound $H$ , is asymptotically minimax. However, if we constrain the estimates to lie in the simplex of probability distributions, then the asymptotic minimax risk is again $2 H / ln n$ . We draw connections between our work and the literature on density estimation, entropy estimation, total variation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Bayesian Modeling and Causal Inference