Tight Bounds on the Hardness of Learning Simple Nonparametric Mixtures
Bryon Aragam, Wai Ming Tai

TL;DR
This paper establishes tight bounds on the sample complexity for learning component distributions in nonparametric mixture models with disjoint supports, revealing an intermediate complexity between polynomial and exponential.
Contribution
It introduces tight bounds on the sample complexity for learning mixture components under specific convolution assumptions, and provides a novel algorithm matching these bounds.
Findings
Sample complexity is between polynomial and exponential in 1/ε.
Proposed algorithm achieves near-optimal sample complexity.
Analysis uses a new approach involving orthogonal functions and a Tauberian theorem.
Abstract
We study the problem of learning nonparametric distributions in a finite mixture, and establish tight bounds on the sample complexity for learning the component distributions in such models. Namely, we are given i.i.d. samples from a pdf where and we are interested in learning each component . Without any assumptions on , this problem is ill-posed. In order to identify the components , we assume that each can be written as a convolution of a Gaussian and a compactly supported density with . Our main result shows that samples are required for estimating each . The proof relies on a quantitative Tauberian theorem that yields a fast rate of approximation with Gaussians, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Bayesian Methods and Mixture Models · Algorithms and Data Compression
MethodsConvolution
