Dimension reduction for integrative survival analysis
Aaron J. Molstad, Rohit K. Patra

TL;DR
This paper introduces a new dimension reduction method for integrative survival analysis across multiple populations, improving efficiency and prediction accuracy by identifying key linear combinations of predictors.
Contribution
It proposes a constrained maximum partial likelihood estimator that leverages distance-to-set penalties for efficient dimension reduction in high-dimensional, multi-population survival data.
Findings
Method outperforms competitors in simulations.
Identifies key protein combinations across cancer types.
Improves survival prediction on external datasets.
Abstract
We propose a constrained maximum partial likelihood estimator for dimension reduction in integrative (e.g., pan-cancer) survival analysis with high-dimensional covariates. We assume that for each population in the study, the hazard function follows a distinct Cox proportional hazards model. To borrow information across populations, we assume that all of the hazard functions depend only on a small number of linear combinations of the predictors. We estimate these linear combinations using an algorithm based on "distance-to-set" penalties. This allows us to impose both low-rankness and sparsity. We derive asymptotic results which reveal that our regression coefficient estimator is more efficient than fitting a separate proportional hazards model for each population. Numerical experiments suggest that our method outperforms related competitors under various data generating models. We use…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Genetic factors in colorectal cancer · Gene expression and cancer classification
