Clustering by Low-Rank Doubly Stochastic Matrix Decomposition
Zhirong Yang (Aalto University), Erkki Oja (Aalto University)

TL;DR
This paper introduces a novel low-rank clustering method based on bipartite random walks and probabilistic modeling, outperforming traditional matrix factorization approaches especially on large-scale data.
Contribution
It proposes a new low-rank learning approach that goes beyond matrix factorization, using a probabilistic model and a specialized optimization algorithm for improved clustering.
Findings
Strong performance in clustering purity across various datasets
Particularly effective on large-scale manifold data
Optimization via relaxed Majorization-Minimization algorithm
Abstract
Clustering analysis by nonnegative low-rank approximations has achieved remarkable progress in the past decade. However, most approximation approaches in this direction are still restricted to matrix factorization. We propose a new low-rank learning method to improve the clustering performance, which is beyond matrix factorization. The approximation is based on a two-step bipartite random walk through virtual cluster nodes, where the approximation is formed by only cluster assigning probabilities. Minimizing the approximation error measured by Kullback-Leibler divergence is equivalent to maximizing the likelihood of a discriminative model, which endows our method with a solid probabilistic interpretation. The optimization is implemented by a relaxed Majorization-Minimization algorithm that is advantageous in finding good local minima. Furthermore, we point out that the regularized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Face and Expression Recognition · Tensor decomposition and applications
