Predicting Survival Outcomes in the Presence of Unlabeled Data
Fateme Nateghi Haredasht, Celine Vens

TL;DR
This paper explores leveraging unlabeled patient data in survival analysis to improve prediction accuracy, introducing a third supervision level and comparing three methods across multiple datasets.
Contribution
It introduces a novel semi-supervised approach incorporating unlabeled data into survival analysis, enhancing predictive performance over traditional methods.
Findings
All proposed methods improved prediction accuracy.
Semi-supervised wrapper approach often yielded the best results.
Including unlabeled data significantly benefits survival time predictions.
Abstract
Many clinical studies require the follow-up of patients over time. This is challenging: apart from frequently observed drop-out, there are often also organizational and financial challenges, which can lead to reduced data collection and, in turn, can complicate subsequent analyses. In contrast, there is often plenty of baseline data available of patients with similar characteristics and background information, e.g., from patients that fall outside the study time window. In this article, we investigate whether we can benefit from the inclusion of such unlabeled data instances to predict accurate survival times. In other words, we introduce a third level of supervision in the context of survival analysis, apart from fully observed and censored instances, we also include unlabeled instances. We propose three approaches to deal with this novel setting and provide an empirical comparison…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Causal Inference Techniques · Statistical Methods in Clinical Trials · Statistical Methods and Inference
MethodsTest
