A Reinforcement Learning Approach to Synthetic Data Generation
Natalia Espinosa-Dice, Nicholas J. Jackson, Chao Yan, Aaron Lee, Bradley A. Malin

TL;DR
This paper introduces RLSyn, a reinforcement learning framework for generating synthetic biomedical data that outperforms traditional models in fidelity, utility, and privacy, especially in small-sample settings.
Contribution
The paper presents RLSyn, a novel RL-based method for synthetic data generation that improves over GANs and diffusion models in biomedical applications.
Findings
RLSyn achieves comparable utility to diffusion models on MIMIC-IV.
RLSyn outperforms diffusion models in fidelity and privacy risk.
Both RLSyn and diffusion models outperform GANs in utility and fidelity.
Abstract
Synthetic data generation (SDG) is a promising approach for enabling data sharing in biomedical studies while preserving patient privacy. Yet, state-of-the-art generative models often require large datasets and complex training procedures, limiting their applicability in small-sample settings common in biomedical research. This study aims to develop a more principled and efficient approach to SDG and evaluate its efficacy for biomedical applications. In this work, we reframe SDG as a reinforcement learning (RL) problem and introduce RLSyn, a novel framework that models the data generator as a stochastic policy over patient records and optimizes it using Proximal Policy Optimization with discriminator-derived rewards. We evaluate RLSyn on two biomedical datasets--AI-READI and MIMIC-IV--and benchmark it against state-of-the-art generative adversarial networks (GANs) and diffusion-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Machine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis
