A Reinforcement Learning Approach to Synthetic Data Generation

Natalia Espinosa-Dice; Nicholas J. Jackson; Chao Yan; Aaron Lee; Bradley A. Malin

arXiv:2512.21395·cs.LG·January 27, 2026

A Reinforcement Learning Approach to Synthetic Data Generation

Natalia Espinosa-Dice, Nicholas J. Jackson, Chao Yan, Aaron Lee, Bradley A. Malin

PDF

Open Access

TL;DR

This paper introduces RLSyn, a reinforcement learning framework for generating synthetic biomedical data that outperforms traditional models in fidelity, utility, and privacy, especially in small-sample settings.

Contribution

The paper presents RLSyn, a novel RL-based method for synthetic data generation that improves over GANs and diffusion models in biomedical applications.

Findings

01

RLSyn achieves comparable utility to diffusion models on MIMIC-IV.

02

RLSyn outperforms diffusion models in fidelity and privacy risk.

03

Both RLSyn and diffusion models outperform GANs in utility and fidelity.

Abstract

Synthetic data generation (SDG) is a promising approach for enabling data sharing in biomedical studies while preserving patient privacy. Yet, state-of-the-art generative models often require large datasets and complex training procedures, limiting their applicability in small-sample settings common in biomedical research. This study aims to develop a more principled and efficient approach to SDG and evaluate its efficacy for biomedical applications. In this work, we reframe SDG as a reinforcement learning (RL) problem and introduce RLSyn, a novel framework that models the data generator as a stochastic policy over patient records and optimizes it using Proximal Policy Optimization with discriminator-derived rewards. We evaluate RLSyn on two biomedical datasets--AI-READI and MIMIC-IV--and benchmark it against state-of-the-art generative adversarial networks (GANs) and diffusion-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Machine Learning in Healthcare · Generative Adversarial Networks and Image Synthesis