Random survival forests with multivariate longitudinal endogenous covariates
Anthony Devaux (BPH), Catherine Helmer (BPH), Robin Genuer (SISTM,, BPH), C\'ecile Proust-Lima (BPH)

TL;DR
This paper introduces DynForest, a novel random survival forest extension that predicts individual event risks using numerous longitudinal endogenous predictors, outperforming traditional models in complex, high-dimensional clinical data scenarios.
Contribution
DynForest provides an innovative method to handle many longitudinal predictors in survival analysis, overcoming limitations of joint models and calibration methods.
Findings
DynForest performs well in both small and large dimensional settings.
It accurately predicts dementia risk using diverse longitudinal markers.
The method identifies key predictors influencing event risk.
Abstract
Predicting the individual risk of a clinical event using the complete patient history is still a major challenge for personalized medicine. Among the methods developed to compute individual dynamic predictions, the joint models have the assets of using all the available information while accounting for dropout. However, they are restricted to a very small number of longitudinal predictors. Our objective was to propose an innovative alternative solution to predict an event probability using a possibly large number of longitudinal predictors. We developed DynForest, an extension of competing-risk random survival forests that handles endogenous longitudinal predictors. At each node of the tree, the time-dependent predictors are translated into time-fixed features (using mixed models) to be used as candidates for splitting the subjects into two subgroups. The individual event probability is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Machine Learning in Healthcare · Bayesian Methods and Mixture Models
