Structured Sparse Non-negative Matrix Factorization with L20-Norm for scRNA-seq Data Analysis
Wenwen Min, Taosheng Xu, Xiang Wan, Tsung-Hui Chang

TL;DR
This paper introduces a novel sparse NMF model with an $ ext{l}_{2,0}$-norm constraint for improved feature selection and clustering in high-dimensional scRNA-seq data, supported by efficient algorithms and theoretical guarantees.
Contribution
It proposes the first $ ext{l}_{2,0}$-norm constrained NMF model with provable properties and develops algorithms for sparse, orthogonal NMF tailored for biological data analysis.
Findings
The proposed methods outperform existing approaches in numerical experiments.
The algorithms effectively handle high-dimensional scRNA-seq data.
The $ ext{l}_{2,0}$-norm constraint enhances feature selection and clustering accuracy.
Abstract
Non-negative matrix factorization (NMF) is a powerful tool for dimensionality reduction and clustering. Unfortunately, the interpretation of the clustering results from NMF is difficult, especially for the high-dimensional biological data without effective feature selection. In this paper, we first introduce a row-sparse NMF with -norm constraint (NMF_), where the basis matrix is constrained by the -norm, such that has a row-sparsity pattern with feature selection. It is a challenge to solve the model, because the -norm is non-convex and non-smooth. Fortunately, we prove that the -norm satisfies the Kurdyka-\L{ojasiewicz} property. Based on the finding, we present a proximal alternating linearized minimization algorithm and its monotone accelerated version to solve the NMF_ model. In addition, we also present…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Gene expression and cancer classification · Blind Source Separation Techniques
