Near-Optimal Sample Complexity Bounds for Maximum Likelihood Estimation   of Multivariate Log-concave Densities

Timothy Carpenter; Ilias Diakonikolas; Anastasios Sidiropoulos,; Alistair Stewart

arXiv:1802.10575·math.ST·December 6, 2018·5 cites

Near-Optimal Sample Complexity Bounds for Maximum Likelihood Estimation of Multivariate Log-concave Densities

Timothy Carpenter, Ilias Diakonikolas, Anastasios Sidiropoulos,, Alistair Stewart

PDF

Open Access

TL;DR

This paper establishes near-optimal bounds on the number of samples needed for the maximum likelihood estimator to learn multivariate log-concave densities in high dimensions, filling a gap in understanding its efficiency.

Contribution

It provides the first finite-sample upper bound for the MLE in dimensions four and higher, showing near-optimal sample complexity for learning log-concave densities.

Findings

01

Sample complexity upper bound of ilde{O}_d((1/ε)^{(d+3)/2})

02

Lower bound of Ω_d((1/ε)^{(d+1)/2})

03

MLE is nearly optimal in high-dimensional density estimation

Abstract

We study the problem of learning multivariate log-concave densities with respect to a global loss function. We obtain the first upper bound on the sample complexity of the maximum likelihood estimator (MLE) for a log-concave density on $R^{d}$ , for all $d \geq 4$ . Prior to this work, no finite sample upper bound was known for this estimator in more than $3$ dimensions. In more detail, we prove that for any $d \geq 1$ and $ϵ > 0$ , given $\tilde{O}_{d} ((1/ ϵ)^{(d + 3) /2})$ samples drawn from an unknown log-concave density $f_{0}$ on $R^{d}$ , the MLE outputs a hypothesis $h$ that with high probability is $ϵ$ -close to $f_{0}$ , in squared Hellinger loss. A sample complexity lower bound of $Ω_{d} ((1/ ϵ)^{(d + 1) /2})$ was previously known for any learning algorithm that achieves this guarantee. We thus establish that the sample complexity of the log-concave MLE…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Domain Adaptation and Few-Shot Learning