Unbiased split variable selection for random survival forests using maximally selected rank statistics
Marvin N. Wright, Theresa Dankowski, Andreas Ziegler

TL;DR
This paper introduces an unbiased split variable selection method for random survival forests using maximally selected rank statistics, improving detection of non-linear effects and reducing bias, with better prediction performance and faster computation.
Contribution
The paper proposes a novel split criterion for random survival forests based on maximally selected rank statistics, addressing bias and non-linear effect detection.
Findings
Unbiased split variable selection is achievable with the new method.
The method outperforms existing random survival forests in prediction accuracy.
It is computationally faster when using simple p-value approximations.
Abstract
The most popular approach for analyzing survival data is the Cox regression model. The Cox model may, however, be misspecified, and its proportionality assumption may not always be fulfilled. An alternative approach for survival prediction is random forests for survival outcomes. The standard split criterion for random survival forests is the log-rank test statistics, which favors splitting variables with many possible split points. Conditional inference forests avoid this split variable selection bias. However, linear rank statistics are utilized by default in conditional inference forests to select the optimal splitting variable, which cannot detect non-linear effects in the independent variables. An alternative is to use maximally selected rank statistics for the split point selection. As in conditional inference forests, splitting variables are compared on the p-value scale.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
