The Broad Optimality of Profile Maximum Likelihood

Yi Hao; Alon Orlitsky

arXiv:1906.03794·stat.ML·July 12, 2019·1 cites

The Broad Optimality of Profile Maximum Likelihood

Yi Hao, Alon Orlitsky

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that the profile maximum likelihood (PML) estimator is a unified, sample-optimal approach for various fundamental distribution learning tasks, achieving optimal or near-optimal sample complexities across multiple problems.

Contribution

The paper introduces and analyzes the PML estimator as a universal, sample-optimal method for distribution estimation, property estimation, and testing, including novel variants like truncated PML (TPML).

Findings

01

PML achieves optimal sample complexity for sorted-distribution estimation.

02

PML-based estimators outperform traditional methods like Good-Turing.

03

The paper introduces a near-linear-time computable PML variant and a novel truncated PML (TPML).

Abstract

We study three fundamental statistical-learning problems: distribution estimation, property estimation, and property testing. We establish the profile maximum likelihood (PML) estimator as the first unified sample-optimal approach to a wide range of learning tasks. In particular, for every alphabet size $k$ and desired accuracy $ε$ : $Distribution estimation$ Under $ℓ_{1}$ distance, PML yields optimal $Θ (k / (ε^{2} lo g k))$ sample complexity for sorted-distribution estimation, and a PML-based estimator empirically outperforms the Good-Turing estimator on the actual distribution; $Additive property estimation$ For a broad class of additive properties, the PML plug-in estimator uses just four times the sample size required by the best estimator to achieve roughly twice its error, with exponentially higher confidence;…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ucsdyi/PML
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression