Comparing estimators of discriminative performance of time-to-event models
Ying Jin, Andrew Leroux

TL;DR
This paper compares semi-parametric and non-parametric estimators of discriminative performance in time-to-event models, revealing biases in semi-parametric estimators that can lead to overly optimistic out-of-sample performance assessments.
Contribution
It uncovers a previously unknown bias in semi-parametric estimators and proposes smoothing techniques to improve non-parametric estimator stability.
Findings
Semi-parametric estimators can be overly optimistic in out-of-sample evaluation.
Non-parametric estimators are more variable but less biased.
Smoothing improves the stability of non-parametric estimators.
Abstract
Predicting the timing and occurrence of events is a major focus of data science applications, especially in the context of biomedical research. Performance for models estimating these outcomes, often referred to as time-to-event or survival outcomes, is frequently summarized using measures of discrimination, in particular time-dependent AUC and concordance. Many estimators for these quantities have been proposed which can be broadly categorized as either semi-parametric estimators or non-parametric estimators. In this paper, we review various estimators' mathematical construction and compare the behavior of the two classes of estimators. Importantly, we identify a previously unknown feature of the class of semi-parametric estimators that can result in vastly over-optimistic out-of-sample estimation of discriminative performance in common applied tasks. Although these semi-parametric…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Advanced Statistical Process Monitoring
