Computing the Hazard Ratios Associated with Explanatory Variables Using   Machine Learning Models of Survival Data

Sameer Sundrani; James Lu

arXiv:2102.00637·cs.LG·April 6, 2021

Computing the Hazard Ratios Associated with Explanatory Variables Using Machine Learning Models of Survival Data

Sameer Sundrani, James Lu

PDF

1 Repo

TL;DR

This paper introduces a novel method to compute hazard ratios from tree-based machine learning models in survival analysis using SHAP values, enabling better risk factor identification.

Contribution

It presents a new approach to derive hazard ratios from ML models, specifically using SHAP values, which was not previously available for such models.

Findings

01

XGBoost performs comparably to CoxPH in survival prediction.

02

The method provides consistent hazard ratios across datasets.

03

Some variables showed opposite hazard ratio results between models.

Abstract

Purpose: The application of Cox Proportional Hazards (CoxPH) models to survival data and the derivation of Hazard Ratio (HR) is well established. While nonlinear, tree-based Machine Learning (ML) models have been developed and applied to the survival analysis, no methodology exists for computing HRs associated with explanatory variables from such models. We describe a novel way to compute HRs from tree-based ML models using the Shapley additive explanation (SHAP) values, which is a locally accurate and consistent methodology to quantify explanatory variables' contribution to predictions. Methods: We used three sets of publicly available survival data consisting of patients with colon, breast or pan cancer and compared the performance of CoxPH to the state-of-art ML model, XGBoost. To compute the HR for explanatory variables from the XGBoost model, the SHAP values were exponentiated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jameslu01/Compute_HazRatio_ML
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsShapley Additive Explanations