Predicting Survival Outcomes in the Presence of Unlabeled Data

Fateme Nateghi Haredasht; Celine Vens

arXiv:2210.13891·cs.LG·October 26, 2022

Predicting Survival Outcomes in the Presence of Unlabeled Data

Fateme Nateghi Haredasht, Celine Vens

PDF

Open Access

TL;DR

This paper explores leveraging unlabeled patient data in survival analysis to improve prediction accuracy, introducing a third supervision level and comparing three methods across multiple datasets.

Contribution

It introduces a novel semi-supervised approach incorporating unlabeled data into survival analysis, enhancing predictive performance over traditional methods.

Findings

01

All proposed methods improved prediction accuracy.

02

Semi-supervised wrapper approach often yielded the best results.

03

Including unlabeled data significantly benefits survival time predictions.

Abstract

Many clinical studies require the follow-up of patients over time. This is challenging: apart from frequently observed drop-out, there are often also organizational and financial challenges, which can lead to reduced data collection and, in turn, can complicate subsequent analyses. In contrast, there is often plenty of baseline data available of patients with similar characteristics and background information, e.g., from patients that fall outside the study time window. In this article, we investigate whether we can benefit from the inclusion of such unlabeled data instances to predict accurate survival times. In other words, we introduce a third level of supervision in the context of survival analysis, apart from fully observed and censored instances, we also include unlabeled instances. We propose three approaches to deal with this novel setting and provide an empirical comparison…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Statistical Methods in Clinical Trials · Statistical Methods and Inference

MethodsTest