BigSurvSGD: Big Survival Data Analysis via Stochastic Gradient Descent

Aliasghar Tarkhan; Noah Simon

arXiv:2003.00116·math.ST·August 11, 2020·5 cites

BigSurvSGD: Big Survival Data Analysis via Stochastic Gradient Descent

Aliasghar Tarkhan, Noah Simon

PDF

Open Access

TL;DR

This paper introduces BigSurvSGD, a novel stochastic gradient descent-based method for efficiently fitting proportional hazards models on large-scale survival datasets, enabling scalable and complex survival analysis.

Contribution

The paper proposes a new formulation of proportional hazards regression that is compatible with stochastic gradient descent, addressing computational challenges in large datasets.

Findings

01

Efficiently fits survival models on very large datasets.

02

Enables training complex neural network-based survival models.

03

Improves computational stability and scalability.

Abstract

In many biomedical applications, outcome is measured as a ``time-to-event'' (eg. disease progression or death). To assess the connection between features of a patient and this outcome, it is common to assume a proportional hazards model, and fit a proportional hazards regression (or Cox regression). To fit this model, a log-concave objective function known as the ``partial likelihood'' is maximized. For moderate-sized datasets, an efficient Newton-Raphson algorithm that leverages the structure of the objective can be employed. However, in large datasets this approach has two issues: 1) The computational tricks that leverage structure can also lead to computational instability; 2) The objective does not naturally decouple: Thus, if the dataset does not fit in memory, the model can be very computationally expensive to fit. This additionally means that the objective is not directly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques