Clustering and Latent Semantic Indexing Aspects of the Nonnegative Matrix Factorization
Andri Mirzal

TL;DR
This paper provides a theoretical foundation for the clustering capabilities of nonnegative matrix factorization (NMF) by linking it to graph clustering and evaluates its latent semantic indexing performance against SVD.
Contribution
It establishes a theoretical equivalence between NMF and graph clustering using KKT conditions and evaluates NMF's LSI ability compared to SVD.
Findings
NMF objective is equivalent to graph clustering objective.
NMF effectively addresses synonymy and polysemy in synthetic datasets.
NMF's LSI performance is comparable or superior to SVD in experiments.
Abstract
This paper provides a theoretical support for clustering aspect of the nonnegative matrix factorization (NMF). By utilizing the Karush-Kuhn-Tucker optimality conditions, we show that NMF objective is equivalent to graph clustering objective, so clustering aspect of the NMF has a solid justification. Different from previous approaches which usually discard the nonnegativity constraints, our approach guarantees the stationary point being used in deriving the equivalence is located on the feasible region in the nonnegative orthant. Additionally, since clustering capability of a matrix decomposition technique can sometimes imply its latent semantic indexing (LSI) aspect, we will also evaluate LSI aspect of the NMF by showing its capability in solving the synonymy and polysemy problems in synthetic datasets. And more extensive evaluation will be conducted by comparing LSI performances of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Graph Theory and Algorithms · Image Retrieval and Classification Techniques
