A High-Performance Parallel Algorithm for Nonnegative Matrix Factorization
Ramakrishnan Kannan, Grey Ballard, Haesun Park

TL;DR
This paper introduces the first high-performance parallel algorithm for non-negative matrix factorization that efficiently handles large dense and sparse datasets using distributed-memory computing, MPI, and flexible local solvers.
Contribution
It presents a novel distributed-memory parallel NMF algorithm that minimizes communication costs, supports both dense and sparse matrices, and allows flexible local NLS solvers, outperforming existing methods.
Findings
Significant scalability demonstrated on large datasets
Performance improvements over baseline implementations
Effective handling of both dense and sparse matrices
Abstract
Non-negative matrix factorization (NMF) is the problem of determining two non-negative low rank factors and , for the given input matrix , such that . NMF is a useful tool for many applications in different domains such as topic modeling in text mining, background separation in video analysis, and community detection in social networks. Despite its popularity in the data mining community, there is a lack of efficient parallel software to solve the problem for big datasets. Existing distributed-memory algorithms are limited in terms of performance and applicability, as they are implemented using Hadoop and are designed only for sparse matrices. We propose a distributed-memory parallel algorithm that computes the factorization by iteratively solving alternating non-negative least squares (NLS) subproblems for and . To our knowledge, our algorithm is the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Sparse and Compressive Sensing Techniques · Matrix Theory and Algorithms
