Improved Spectral-Norm Bounds for Clustering
Pranjal Awasthi, Or Sheffet

TL;DR
This paper enhances spectral-norm bounds for clustering, weakening separation and proximity conditions while maintaining guarantees, thus broadening applicability to various mixture models.
Contribution
It introduces weaker separation and proximity bounds for clustering, enabling effective clustering under less restrictive conditions than prior work.
Findings
Achieves the same guarantees with a factor of √k weaker separation bound.
Provides better guarantees when only a fraction of points satisfy the proximity condition.
Matches and improves upon results in planted partition and Gaussian mixture models.
Abstract
Aiming to unify known results about clustering mixtures of distributions under separation conditions, Kumar and Kannan[2010] introduced a deterministic condition for clustering datasets. They showed that this single deterministic condition encompasses many previously studied clustering assumptions. More specifically, their proximity condition requires that in the target -clustering, the projection of a point onto the line joining its cluster center and some other center , is a large additive factor closer to than to . This additive factor can be roughly described as times the spectral norm of the matrix representing the differences between the given (known) dataset and the means of the (unknown) target clustering. Clearly, the proximity condition implies center separation -- the distance between any two centers must be as large as the above mentioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Advanced Clustering Algorithms Research · Face and Expression Recognition
