Assessing variable importance in survival analysis using machine learning
Charles J. Wolock, Peter B. Gilbert, Noah Simon, Marco Carone

TL;DR
This paper introduces a flexible, efficient, and robust method for assessing the importance of variables in survival analysis models, addressing the challenge of right-censored data and providing valid inference.
Contribution
It proposes a broad class of algorithm-agnostic importance measures with a nonparametric estimation procedure that is double-robust and suitable for survival data.
Findings
Method performs well in simulations
Provides valid inference for variable importance
Applied to HIV vaccine trial data
Abstract
Given a collection of features available for inclusion in a predictive model, it may be of interest to quantify the relative importance of a subset of features for the prediction task at hand. For example, in HIV vaccine trials, participant baseline characteristics are used to predict the probability of HIV acquisition over the intended follow-up period, and investigators may wish to understand how much certain types of predictors, such as behavioral factors, contribute toward overall predictiveness. Time-to-event outcomes such as time to HIV acquisition are often subject to right censoring, and existing methods for assessing variable importance are typically not intended to be used in this setting. We describe a broad class of algorithm-agnostic variable importance measures for prediction in the context of survival data. We propose a nonparametric efficient estimation procedure that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · HIV Research and Treatment
