The Broad Optimality of Profile Maximum Likelihood
Yi Hao, Alon Orlitsky

TL;DR
This paper demonstrates that the profile maximum likelihood (PML) estimator is a unified, sample-optimal approach for various fundamental distribution learning tasks, achieving optimal or near-optimal sample complexities across multiple problems.
Contribution
The paper introduces and analyzes the PML estimator as a universal, sample-optimal method for distribution estimation, property estimation, and testing, including novel variants like truncated PML (TPML).
Findings
PML achieves optimal sample complexity for sorted-distribution estimation.
PML-based estimators outperform traditional methods like Good-Turing.
The paper introduces a near-linear-time computable PML variant and a novel truncated PML (TPML).
Abstract
We study three fundamental statistical-learning problems: distribution estimation, property estimation, and property testing. We establish the profile maximum likelihood (PML) estimator as the first unified sample-optimal approach to a wide range of learning tasks. In particular, for every alphabet size and desired accuracy : Under distance, PML yields optimal sample complexity for sorted-distribution estimation, and a PML-based estimator empirically outperforms the Good-Turing estimator on the actual distribution; For a broad class of additive properties, the PML plug-in estimator uses just four times the sample size required by the best estimator to achieve roughly twice its error, with exponentially higher confidence;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression
