Better Agnostic Clustering Via Relaxed Tensor Norms
Pravesh K. Kothari, Jacob Steinhardt

TL;DR
This paper introduces a new convex relaxation method for $k$-means clustering using sum-of-squares norms, enabling efficient learning of mixture models with certain distributional properties and outlier robustness.
Contribution
It develops a novel convex relaxation based on sum-of-squares norms and proves bounds for Poincare distributions, advancing clustering and mixture learning in robust and high-dimensional settings.
Findings
Efficient algorithm for learning mixture means with separation $ ext{Omega}(k^{ ext{gamma}})$.
Partial resolution of the open problem on Gaussian mixture separation.
Robust clustering algorithm effective even with arbitrary outliers.
Abstract
We develop a new family of convex relaxations for -means clustering based on sum-of-squares norms, a relaxation of the injective tensor norm that is efficiently computable using the Sum-of-Squares algorithm. We give an algorithm based on this relaxation that recovers a faithful approximation to the true means in the given data whenever the low-degree moments of the points in each cluster have bounded sum-of-squares norms. We then prove a sharp upper bound on the sum-of-squares norms for moment tensors of any distribution that satisfies the \emph{Poincare inequality}. The Poincare inequality is a central inequality in probability theory, and a large class of distributions satisfy it including Gaussians, product distributions, strongly log-concave distributions, and any sum or uniformly continuous transformation of such distributions. As an immediate corollary, for any $\gamma >…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Bayesian Methods and Mixture Models · Machine Learning and Algorithms
