Accelerated and interpretable oblique random survival forests
Byron C. Jaeger, Sawyer Welden, Kristin Lenoir, Jaime L. Speiser,, Matthew W. Segar, Ambarish Pandey, Nicholas M. Pajewski

TL;DR
This paper presents a faster, more interpretable oblique random survival forest method that improves computational efficiency and predictor importance estimation, outperforming existing approaches in accuracy and speed.
Contribution
It introduces a novel Newton-Raphson based optimization for oblique RSFs and a new predictor importance measure, enhancing speed and interpretability.
Findings
Oblique RSF is approximately 450 times faster with similar or better accuracy.
The new importance measure reliably identifies relevant predictors.
Methods are implemented in the aorsf R package.
Abstract
The oblique random survival forest (RSF) is an ensemble supervised learning method for right-censored outcomes. Trees in the oblique RSF are grown using linear combinations of predictors to create branches, whereas in the standard RSF, a single predictor is used. Oblique RSF ensembles often have higher prediction accuracy than standard RSF ensembles. However, assessing all possible linear combinations of predictors induces significant computational overhead that limits applications to large-scale data sets. In addition, few methods have been developed for interpretation of oblique RSF ensembles, and they remain more difficult to interpret compared to their axis-based counterparts. We introduce a method to increase computational efficiency of the oblique RSF and a method to estimate importance of individual predictor variables with the oblique RSF. Our strategy to reduce computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Analysis with R · Forest ecology and management · Statistical Methods and Inference
