Learning Mixture Models via Efficient High-dimensional Sparse Fourier Transforms
Alkis Kalavasis, Pravesh K. Kothari, Shuchen Li, Manolis Zampetakis

TL;DR
This paper introduces a polynomial-time algorithm for learning mixture models of heavy-tailed distributions in high dimensions using sparse Fourier transforms, bypassing traditional moment-based limitations.
Contribution
The authors develop a novel high-dimensional sparse Fourier transform technique for mixture model learning that works with heavy-tailed distributions and requires no minimum separation.
Findings
Algorithm successfully learns heavy-tailed mixture components in polynomial time.
Method applies to distributions without finite covariances, like Laplace.
No minimum separation needed between cluster means.
Abstract
In this work, we give a time and sample algorithm for efficiently learning the parameters of a mixture of spherical distributions in dimensions. Unlike all previous methods, our techniques apply to heavy-tailed distributions and include examples that do not even have finite covariances. Our method succeeds whenever the cluster distributions have a characteristic function with sufficiently heavy tails. Such distributions include the Laplace distribution but crucially exclude Gaussians. All previous methods for learning mixture models relied implicitly or explicitly on the low-degree moments. Even for the case of Laplace distributions, we prove that any such algorithm must use super-polynomially many samples. Our method thus adds to the short list of techniques that bypass the limitations of the method of moments. Somewhat surprisingly, our algorithm does not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Bayesian Methods and Mixture Models · Gaussian Processes and Bayesian Inference
