Fast Parallel Randomized Algorithm for Nonnegative Matrix Factorization   with KL Divergence for Large Sparse Datasets

Duy Khuong Nguyen; Tu Bao Ho

arXiv:1604.04026·math.OC·April 15, 2016

Fast Parallel Randomized Algorithm for Nonnegative Matrix Factorization with KL Divergence for Large Sparse Datasets

Duy Khuong Nguyen, Tu Bao Ho

PDF

Open Access

TL;DR

This paper introduces a fast parallel randomized coordinate descent algorithm for Nonnegative Matrix Factorization with KL divergence, effectively handling large sparse datasets and improving convergence and performance over existing methods.

Contribution

The paper presents a novel fast parallel randomized algorithm for NMF-KL that achieves better convergence and scalability on large sparse datasets.

Findings

01

Outperforms existing methods in speed and accuracy

02

Achieves sparse models and representations efficiently

03

Effective for large-scale sparse data analysis

Abstract

Nonnegative Matrix Factorization (NMF) with Kullback-Leibler Divergence (NMF-KL) is one of the most significant NMF problems and equivalent to Probabilistic Latent Semantic Indexing (PLSI), which has been successfully applied in many applications. For sparse count data, a Poisson distribution and KL divergence provide sparse models and sparse representation, which describe the random variation better than a normal distribution and Frobenius norm. Specially, sparse models provide more concise understanding of the appearance of attributes over latent components, while sparse representation provides concise interpretability of the contribution of latent components over instances. However, minimizing NMF with KL divergence is much more difficult than minimizing NMF with Frobenius norm; and sparse models, sparse representation and fast algorithms for large sparse datasets are still…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition · Text and Document Classification Technologies · Bayesian Methods and Mixture Models

MethodsInterpretability