Cross validating extensions of kernel, sparse or regular partial least squares regression models to censored data
Fr\'ed\'eric Bertrand, Philippe Bastien, Myriam Maumy-Bertrand

TL;DR
This paper investigates the failure of standard cross-validation methods for extended partial least squares regression models in censored data, proposes new criteria, and demonstrates improved performance through extensive simulations.
Contribution
It reveals the inadequacy of traditional cross-validation for PLS extensions in Cox models and introduces new robust criteria that enhance model selection and performance.
Findings
Standard cross-validation fails for PLS extensions in Cox models.
New criteria improve component selection and model performance.
Proposed measures provide more robust evaluation in censored data contexts.
Abstract
When cross-validating standard or extended Cox models, the commonly used criterion is the cross-validated partial loglikelihood using a naive or a van Houwelingen scheme -to make efficient use of the death times of the left out data in relation to the death times of all the data-. Quite astonishingly, we will show, using a strong simulation study involving three different data simulation algorithms, that these two cross-validation methods fail with the extensions, either straightforward or more involved ones, of partial least squares regression to the Cox model. This is quite an interesting result for at least two reasons. Firstly, several nice features of PLS based models, including regularization, interpretability of the components, missing data support, data visualization thanks to biplots of individuals and variables -and even parsimony for SPLS based models-, account for a common…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Optimal Experimental Design Methods · Animal Nutrition and Physiology
