Nonnegative Matrix Factorization in the Component-Wise L1 Norm for Sparse Data
Giovanni Seraghiti, K\'evin Dubrulle, Arnaud Vandaele, Nicolas Gillis

TL;DR
This paper introduces a new L1-norm based nonnegative matrix factorization method tailored for sparse, noisy data, providing theoretical insights, a novel model, and an efficient coordinate descent algorithm.
Contribution
The paper proves NP-hardness of L1-NMF, proposes a weighted L1-NMF model for sparse data, and develops a scalable coordinate descent algorithm for large-scale applications.
Findings
L1-NMF enforces sparsity and interpretability in factors.
Weighted L1-NMF effectively handles false zeros in data.
The sCD algorithm scales with nonzero data entries, enabling large-scale processing.
Abstract
Nonnegative matrix factorization (NMF) approximates a nonnegative matrix, , by the product of two nonnegative factors, , where has columns and has rows. In this paper, we consider NMF using the component-wise L1 norm as the error measure (L1-NMF), which is suited for data corrupted by heavy-tailed noise, such as Laplace noise or salt and pepper noise, or in the presence of outliers. Our first contribution is an NP-hardness proof for L1-NMF, even when , in contrast to the standard NMF that uses least squares. Our second contribution is to show that L1-NMF strongly enforces sparsity in the factors for sparse input matrices, thereby favoring interpretability. However, if the data is affected by false zeros, too sparse solutions might degrade the model. Our third contribution is a new, more general, L1-NMF model for sparse data, dubbed weighted L1-NMF (wL1-NMF),…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
