Geometric structure guided model and algorithms for complete deconvolution of gene expression data
Duan Chen, Shaoyu Li, Xue Wang

TL;DR
This paper introduces a geometric structure guided NMF model for complete deconvolution of bulk RNAseq data, enhancing interpretability and accuracy by integrating biological marker genes and spectral clustering.
Contribution
It develops a novel NMF-based model that incorporates biological markers and geometric structure to improve solution identifiability in RNAseq deconvolution.
Findings
Significantly improved solution interpretability.
Enhanced accuracy in deconvolution results.
Validated on synthetic and biological data.
Abstract
Complete deconvolution analysis for bulk RNAseq data is important and helpful to distinguish whether the difference of disease-associated GEPs (gene expression profiles) in tissues of patients and normal controls are due to changes in cellular composition of tissue samples, or due to GEPs changes in specific cells. One of the major techniques to perform complete deconvolution is nonnegative matrix factorization (NMF), which also has a wide-range of applications in the machine learning community. However, the NMF is a well-known strongly ill-posed problem, so a direct application of NMF to RNAseq data will suffer severe difficulties in the interpretability of solutions. In this paper we develop an NMF-based mathematical model and corresponding computational algorithms to improve the solution identifiability of deconvoluting bulk RNAseq data. In our approach, we combine the biological…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genomics and Chromatin Dynamics · RNA Research and Splicing
