Low-Rank Reorganization via Proportional Hazards Non-negative Matrix Factorization Unveils Survival Associated Gene Clusters
Zhi Huang, Paul Salama, Wei Shao, Jie Zhang, Kun Huang

TL;DR
This paper introduces a novel method combining non-negative matrix factorization with Cox proportional hazards regression to identify gene clusters associated with survival outcomes, improving interpretability of high-dimensional genomic data.
Contribution
It proposes a joint optimization framework integrating survival constraints into NMF, which enhances the detection of survival-related gene clusters in high-dimensional data.
Findings
Outperforms existing algorithms in synthetic data simulations.
Unveils biologically meaningful cancer gene clusters.
Identifies potential survival biomarkers.
Abstract
One of the central goals in precision health is the understanding and interpretation of high-dimensional biological data to identify genes and markers associated with disease initiation, development, and outcomes. Though significant effort has been committed to harness gene expression data for multiple analyses while accounting for time-to-event modeling by including survival times, many traditional analyses have focused separately on non-negative matrix factorization (NMF) of the gene expression data matrix and survival regression with Cox proportional hazards model. In this work, Cox proportional hazards regression is integrated with NMF by imposing survival constraints. This is accomplished by jointly optimizing the Frobenius norm and partial log likelihood for events such as death or relapse. Simulation results on synthetic data demonstrated the superiority of the proposed method,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Statistical Methods and Inference
