Enabling Granular Subgroup Level Model Evaluations by Generating Synthetic Medical Time Series
Mahmoud Ibrahim, Bart Elen, Chang Sun, G\"okhan Ertaylan, Michel Dumontier

TL;DR
This paper introduces an advanced synthetic data generation framework that improves the evaluation of predictive models in ICU settings, especially for detailed demographic subgroups, ensuring trustworthiness and privacy preservation.
Contribution
We propose Enhanced TimeAutoDiff, a novel synthetic data generator that significantly improves subgroup evaluation accuracy and reduces evaluation gaps compared to prior methods.
Findings
Enhanced TimeAutoDiff reduces TRTS gap by over 70%
Synthetic cohorts improve subgroup AUROC estimation accuracy by up to 50%
Our approach outperforms real data in most subgroups in 72-84% of cases
Abstract
We present a novel framework for leveraging synthetic ICU time-series data not only to train but also to rigorously and trustworthily evaluate predictive models, both at the population level and within fine-grained demographic subgroups. Building on prior diffusion and VAE-based generators (TimeDiff, HealthGen, TimeAutoDiff), we introduce \textit{Enhanced TimeAutoDiff}, which augments the latent diffusion objective with distribution-alignment penalties. We extensively benchmark all models on MIMIC-III and eICU, on 24-hour mortality and binary length-of-stay tasks. Our results show that Enhanced TimeAutoDiff reduces the gap between real-on-synthetic and real-on-real evaluation (``TRTS gap'') by over 70\%, achieving AUROC, while preserving training utility (). Crucially, for 32 intersectional subgroups, large synthetic cohorts cut…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Sepsis Diagnosis and Treatment · Artificial Intelligence in Healthcare and Education
