Nearly Minimax Discrete Distribution Estimation in Kullback-Leibler Divergence with High Probability
Dirk van der Hoeven, Julia Olkhovskaia, Tim van Erven

TL;DR
This paper establishes near-optimal rates for estimating discrete distributions in Kullback-Leibler divergence with high probability, revealing the complexity and limitations of current estimation techniques.
Contribution
It provides tight upper and lower bounds on the minimax estimation rate in KL divergence, introduces a novel estimator via online learning, and develops a new reduction to weak hypothesis testing.
Findings
The minimax rate is between (K + ln(K)ln(1/δ))/n and (K ln ln(K) + ln(K)ln(1/δ))/n.
Standard hypothesis testing reductions are insufficient for tight lower bounds in KL estimation.
The maximum likelihood estimator achieves the total variation rate under certain conditions.
Abstract
We consider the fundamental problem of estimating a discrete distribution on a domain of size with high probability in Kullback-Leibler divergence. We provide upper and lower bounds on the minimax estimation rate, which show that the optimal rate is between and at error probability and sample size , which pins down the rate up to the doubly logarithmic factor that multiplies . Our upper bound uses techniques from online learning to construct a novel estimator via online-to-batch conversion. Perhaps surprisingly, the tail behavior of the minimax rate is worse than for the squared total variation and squared Hellinger distance, for which it is , i.e. without the multiplying . As a consequence, we cannot obtain a fully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed Sensor Networks and Detection Algorithms · Advanced Wireless Communication Techniques · Advanced Data Compression Techniques
