Should univariate Cox regression be used for feature selection with respect to time-to-event outcomes?
Rong Lu (The Quantitative Sciences Unit, Division of Biomedical, Informatics Research, Department of Medicine, Stanford University)

TL;DR
This study compares Cox regression and Gaussian regression of log-transformed survival time for feature selection in time-to-event data, finding Gaussian regression often outperforms Cox models in simulation scenarios.
Contribution
It introduces the use of Gaussian regression of log-transformed survival time as a superior alternative for feature selection over traditional Cox models.
Findings
Gaussian regression outperforms Cox in sensitivity and effect size ranking
Log-transformed Gaussian regression maintains high accuracy across scenarios
Cox models are less effective for feature selection in simulated time-to-event data
Abstract
IMPORTANCE: Time-to-event outcomes are commonly used in clinical trials and biomarker discovery studies and have been primarily analyzed using Cox proportional hazards models. But it's unclear which statistical models should be recommended for feature selection tasks when time-to-event outcomes are of the primary interest. OBJECTIVE: To explore if Gaussian regression of log-transformed survival time could outperform Cox proportional hazards models in feature selection. DESIGN: In this simulation study, the true models are multivariate Cox proportional hazards models with 10 covariates. For all feature selection comparisons, it's assumed that only 5 out the 10 true features are observed/measured for all model fitting, along with 5 random noise features. Each sample size and censoring rate scenario is explored using 10,000 simulation datasets. Different statistical models are applied to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Computational Drug Discovery Methods · Statistical Methods and Inference
