A scalable and flexible Cox proportional hazards model for high-dimensional survival prediction and functional selection
Boyi Guo, Nengjun Yi

TL;DR
This paper introduces a scalable, flexible additive Cox proportional hazards model with a novel spike-and-slab LASSO prior for high-dimensional survival data, enabling effective functional selection and improved prediction.
Contribution
It develops a new high-dimensional Cox model with bi-level functional selection using spike-and-slab LASSO, and proposes an efficient EM-Coordinate Descent algorithm for scalable fitting.
Findings
Outperforms existing models in simulation studies.
Demonstrates effective functional selection in metabolomics data.
Provides a broadly applicable R package for practical use.
Abstract
Cox proportional hazards model is one of the most popular models in biomedical data analysis. There have been continuing efforts to improve the flexibility of such models for complex signal detection, for example, via additive functions. Nevertheless, the task to extend Cox additive models to accommodate high-dimensional data is nontrivial. When estimating additive functions, commonly used group sparse regularization may introduce excess smoothing shrinkage on additive functions, damaging predictive performance. Moreover, an "all-in-all-out" approach makes functional selection challenging to answer if nonlinear effects exist. We develop an additive Cox PH model to address these challenges in high-dimensional data analysis. Notably, we impose a novel spike-and-slab LASSO prior that motivates the bi-level functional selection on additive functions. A scalable and deterministic algorithm,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Gene expression and cancer classification · Bioinformatics and Genomic Networks
