Optimal Clustering by Lloyd Algorithm for Low-Rank Mixture Model
Zhongyuan Lyu, Dong Xia

TL;DR
This paper introduces a low-rank mixture model for matrix clustering, proposing an efficient Lloyd-based algorithm that achieves minimax optimal error rates under certain conditions, with theoretical and empirical validation.
Contribution
It develops a low-rank adaptation of Lloyd's algorithm for matrix clustering, providing theoretical guarantees and demonstrating improved performance over existing methods.
Findings
Algorithm converges quickly with good initialization.
Achieves minimax optimal clustering error rate.
Outperforms existing methods on real datasets.
Abstract
This paper investigates the computational and statistical limits in clustering matrix-valued observations. We propose a low-rank mixture model (LrMM), adapted from the classical Gaussian mixture model (GMM) to treat matrix-valued observations, which assumes low-rankness for population center matrices. A computationally efficient clustering method is designed by integrating Lloyd's algorithm and low-rank approximation. Once well-initialized, the algorithm converges fast and achieves an exponential-type clustering error rate that is minimax optimal. Meanwhile, we show that a tensor-based spectral method delivers a good initial clustering. Comparable to GMM, the minimax optimal clustering error rate is decided by the separation strength, i.e., the minimal distance between population center matrices. By exploiting low-rankness, the proposed algorithm is blessed with a weaker requirement on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Tensor decomposition and applications · Bayesian Methods and Mixture Models
