On the Competitive Analysis and High Accuracy Optimality of Profile Maximum Likelihood
Yanjun Han, Kirankumar Shiragur

TL;DR
This paper improves the theoretical understanding of profile maximum likelihood (PML) estimators, demonstrating their optimality in estimating symmetric properties of discrete distributions with high accuracy and establishing new bounds on error probabilities.
Contribution
It strengthens previous results by providing tighter error bounds for PML, proving its optimality in estimating sorted distributions, and introduces novel techniques for analyzing PML properties.
Findings
PML achieves near-optimal sample complexity for estimating sorted distributions.
Error probability bounds for PML are significantly improved.
A new estimator with better concentration properties is proposed.
Abstract
A striking result of [Acharya et al. 2017] showed that to estimate symmetric properties of discrete distributions, plugging in the distribution that maximizes the likelihood of observed multiset of frequencies, also known as the profile maximum likelihood (PML) distribution, is competitive compared with any estimators regardless of the symmetric property. Specifically, given observations from the discrete distribution, if some estimator incurs an error with probability at most , then plugging in the PML distribution incurs an error with probability at most . In this paper, we strengthen the above result and show that using a careful chaining argument, the error probability can be reduced to for arbitrarily small constants and some constant . In particular, we show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Statistical Methods and Bayesian Inference
