Statistical inference after variable selection in Cox models: A simulation study
Lena Schemet, Sarah Friedrich-Welz

TL;DR
This paper evaluates various post-variable selection inference methods for Cox models in survival analysis, highlighting their performance and biases through simulations and a real data example.
Contribution
It compares multiple inference procedures after variable selection in Cox models, providing insights into their effectiveness in realistic biomedical scenarios.
Findings
Debiased Lasso provides less biased estimates.
Sample splitting reduces overfitting but may decrease power.
Exact post-selection inference offers valid confidence intervals.
Abstract
Choosing relevant predictors is central to the analysis of biomedical time-to-event data. Classical frequentist inference, however, presumes that the set of covariates is fixed in advance and does not account for data-driven variable selection. As a consequence, naive post-selection inference may be biased and misleading. In right-censored survival settings, these issues may be further exacerbated by the additional uncertainty induced by censoring. We investigate several inference procedures applied after variable selection for the coefficients of the Lasso and its extension, the adaptive Lasso, in the context of the Cox model. The methods considered include sample splitting, exact post-selection inference, and the debiased Lasso. Their performance is examined in a neutral simulation study reflecting realistic covariate structures and censoring rates commonly encountered in biomedical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Advanced Causal Inference Techniques
