Improved Non-parametric Penalized Maximum Likelihood Estimation for Arbitrarily Censored Survival Data
Justin D. Tubbs, Lane Guolan Chen, Thuan Quoc Thach, Pak C. Sham

TL;DR
This paper introduces a kernel smoothing enhancement to non-parametric maximum likelihood estimators for censored survival data, significantly reducing overfitting and improving accuracy in survival function estimation and prediction.
Contribution
It proposes a novel smoothing approach with a BIC-type criterion for censored data, along with an optimization algorithm, improving existing survival analysis methods.
Findings
Reduces discrepancy between estimated and true survival functions by up to 49%.
Improves within-sample and out-of-sample prediction accuracy by up to 41% and 23%.
Demonstrates effectiveness on real breast cancer censored data.
Abstract
Non-parametric maximum likelihood estimation encompasses a group of classic methods to estimate distribution-associated functions from potentially censored and truncated data, with extensive applications in survival analysis. These methods, including the Kaplan-Meier estimator and Turnbull's method, often result in overfitting, especially when the sample size is small. We propose an improvement to these methods by applying kernel smoothing to their raw estimates, based on a BIC-type loss function that balances the trade-off between optimizing model fit and controlling model complexity. In the context of a longitudinal study with repeated observations, we detail our proposed smoothing procedure and optimization algorithm. With extensive simulation studies over multiple realistic scenarios, we demonstrate that our smoothing-based procedure provides better overall accuracy in both survival…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Colorectal Cancer Screening and Detection
