CK4Gen: A Knowledge Distillation Framework for Generating High-Utility Synthetic Survival Datasets in Healthcare
Nicholas I-Hsien Kuo, Blanca Gallego, Louisa Jorm

TL;DR
CK4Gen is a new framework that uses knowledge distillation from Cox models to generate high-utility synthetic survival healthcare data, overcoming limitations of existing generative models and supporting research and education.
Contribution
It introduces CK4Gen, a novel knowledge distillation approach that preserves clinical characteristics in synthetic data, improving utility over traditional generative models.
Findings
Outperforms existing methods in data realism and utility
Enhances survival model performance through data augmentation
Validated on four benchmark datasets
Abstract
Access to real clinical data is heavily restricted by privacy regulations, hindering both healthcare research and education. These constraints slow progress in developing new treatments and data-driven healthcare solutions, while also limiting students' access to real-world datasets, leaving them without essential practical skills. High-utility synthetic datasets are therefore critical for advancing research and providing meaningful training material. However, current generative models -- such as Variational Autoencoders (VAEs) and Generative Adversarial Networks (GANs) -- produce surface-level realism at the expense of healthcare utility, blending distinct patient profiles and producing synthetic data of limited practical relevance. To overcome these limitations, we introduce CK4Gen (Cox Knowledge for Generation), a novel framework that leverages knowledge distillation from Cox…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Frailty in Older Adults
MethodsKnowledge Distillation
