On the use of Harrell's C for clinical risk prediction via random survival forests
Matthias Schmid, Marvin Wright, Andreas Ziegler

TL;DR
This paper investigates replacing the log-rank split criterion with Harrell's C index in random survival forests, demonstrating improved prediction accuracy in certain biomedical scenarios through analytical and simulation studies.
Contribution
It introduces the use of Harrell's C index for node splitting in RSF, aligning the split criterion with the evaluation metric for better prediction performance.
Findings
C index-based splitting improves accuracy in high censoring scenarios.
Log-rank splitting performs better in noisy data.
Implementation available in R package ranger.
Abstract
Random survival forests (RSF) are a powerful method for risk prediction of right-censored outcomes in biomedical research. RSF use the log-rank split criterion to form an ensemble of survival trees. The most common approach to evaluate the prediction accuracy of a RSF model is Harrell's concordance index for survival data ('C index'). Conceptually, this strategy implies that the split criterion in RSF is different from the evaluation criterion of interest. This discrepancy can be overcome by using Harrell's C for both node splitting and evaluation. We compare the difference between the two split criteria analytically and in simulation studies with respect to the preference of more unbalanced splits, termed end-cut preference (ECP). Specifically, we show that the log-rank statistic has a stronger ECP compared to the C index. In simulation studies and with the help of two medical data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Causal Inference Techniques · Genetic Associations and Epidemiology
