How to Simulate Realistic Survival Data? A Simulation Study to Compare Realistic Simulation Models
Maria Thurow, Ina Dormuth, Christina Sauer, Marc Ditzhaus, Markus, Pauly

TL;DR
This study compares various simulation models to generate realistic survival data for clinical trial settings, focusing on lung cancer studies, to guide researchers in choosing appropriate simulation methods.
Contribution
It evaluates and compares multiple simulation approaches using reconstructed benchmark data to recommend the most suitable models for survival data in lung cancer trials.
Findings
Kernel density estimation performs well in realism.
Fitted distributions provide accurate effect size simulations.
Case resampling is effective for preserving data properties.
Abstract
In statistics, it is important to have realistic data sets available for a particular context to allow an appropriate and objective method comparison. For many use cases, benchmark data sets for method comparison are already available online. However, in most medical applications and especially for clinical trials in oncology, there is a lack of adequate benchmark data sets, as patient data can be sensitive and therefore cannot be published. A potential solution for this are simulation studies. However, it is sometimes not clear, which simulation models are suitable for generating realistic data. A challenge is that potentially unrealistic assumptions have to be made about the distributions. Our approach is to use reconstructed benchmark data sets %can be used as a basis for the simulations, which has the following advantages: the actual properties are known and more realistic data can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Statistical Methods and Inference · Health Systems, Economic Evaluations, Quality of Life
