Near-Optimal Sample Complexity Bounds for Maximum Likelihood Estimation of Multivariate Log-concave Densities
Timothy Carpenter, Ilias Diakonikolas, Anastasios Sidiropoulos,, Alistair Stewart

TL;DR
This paper establishes near-optimal bounds on the number of samples needed for the maximum likelihood estimator to learn multivariate log-concave densities in high dimensions, filling a gap in understanding its efficiency.
Contribution
It provides the first finite-sample upper bound for the MLE in dimensions four and higher, showing near-optimal sample complexity for learning log-concave densities.
Findings
Sample complexity upper bound of ilde{O}_d((1/ε)^{(d+3)/2})
Lower bound of Ω_d((1/ε)^{(d+1)/2})
MLE is nearly optimal in high-dimensional density estimation
Abstract
We study the problem of learning multivariate log-concave densities with respect to a global loss function. We obtain the first upper bound on the sample complexity of the maximum likelihood estimator (MLE) for a log-concave density on , for all . Prior to this work, no finite sample upper bound was known for this estimator in more than dimensions. In more detail, we prove that for any and , given samples drawn from an unknown log-concave density on , the MLE outputs a hypothesis that with high probability is -close to , in squared Hellinger loss. A sample complexity lower bound of was previously known for any learning algorithm that achieves this guarantee. We thus establish that the sample complexity of the log-concave MLE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Statistical Methods and Inference · Domain Adaptation and Few-Shot Learning
