TL;DR
This paper introduces a memory-efficient convex optimization algorithm for self-dictionary separable nonnegative matrix factorization, leveraging the Frank-Wolfe method to handle large-scale data with linear memory growth.
Contribution
It proposes the first linear memory complexity algorithm for convex SD-MMV NMF using the Frank-Wolfe approach, improving scalability and robustness in noisy scenarios.
Findings
The algorithm achieves linear memory growth with data size.
It effectively handles noisy data with a regularizer.
Demonstrated success in text mining and community detection tasks.
Abstract
Nonnegative matrix factorization (NMF) often relies on the separability condition for tractable algorithm design. Separability-based NMF is mainly handled by two types of approaches, namely, greedy pursuit and convex programming. A notable convex NMF formulation is the so-called self-dictionary multiple measurement vectors (SD-MMV), which can work without knowing the matrix rank a priori, and is arguably more resilient to error propagation relative to greedy pursuit. However, convex SD-MMV renders a large memory cost that scales quadratically with the problem size. This memory challenge has been around for a decade, and a major obstacle for applying convex SD-MMV to big data analytics. This work proposes a memory-efficient algorithm for convex SD-MMV. Our algorithm capitalizes on the special update rules of a classic algorithm from the 1950s, namely, the Frank-Wolfe (FW) algorithm. It…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
