BigSurvSGD: Big Survival Data Analysis via Stochastic Gradient Descent
Aliasghar Tarkhan, Noah Simon

TL;DR
This paper introduces BigSurvSGD, a novel stochastic gradient descent-based method for efficiently fitting proportional hazards models on large-scale survival datasets, enabling scalable and complex survival analysis.
Contribution
The paper proposes a new formulation of proportional hazards regression that is compatible with stochastic gradient descent, addressing computational challenges in large datasets.
Findings
Efficiently fits survival models on very large datasets.
Enables training complex neural network-based survival models.
Improves computational stability and scalability.
Abstract
In many biomedical applications, outcome is measured as a ``time-to-event'' (eg. disease progression or death). To assess the connection between features of a patient and this outcome, it is common to assume a proportional hazards model, and fit a proportional hazards regression (or Cox regression). To fit this model, a log-concave objective function known as the ``partial likelihood'' is maximized. For moderate-sized datasets, an efficient Newton-Raphson algorithm that leverages the structure of the objective can be employed. However, in large datasets this approach has two issues: 1) The computational tricks that leverage structure can also lead to computational instability; 2) The objective does not naturally decouple: Thus, if the dataset does not fit in memory, the model can be very computationally expensive to fit. This additionally means that the objective is not directly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and Algorithms · Sparse and Compressive Sensing Techniques
