Synthetic Observational Health Data with GANs: from slow adoption to a boom in medical research and ultimately digital twins?
Jeremy Georges-Filteau, Elisa Cirillo

TL;DR
This paper reviews the use of GANs for generating synthetic observational health data, highlighting challenges, recent growth in research, and potential to revolutionize medical data sharing and digital twin applications.
Contribution
It provides a comprehensive review of GAN applications to OHD, identifies challenges in data properties and evaluation, and discusses recent research trends and future directions.
Findings
Growth in publications since 2017
Challenges in data evaluation and benchmarking
Potential for privacy-preserving data sharing
Abstract
After being collected for patient care, Observational Health Data (OHD) can further benefit patient well-being by sustaining the development of health informatics and medical research. Vast potential is unexploited because of the fiercely private nature of patient-related data and regulations to protect it. Generative Adversarial Networks (GANs) have recently emerged as a groundbreaking way to learn generative models that produce realistic synthetic data. They have revolutionized practices in multiple domains such as self-driving cars, fraud detection, digital twin simulations in industrial sectors, and medical imaging. The digital twin concept could readily apply to modelling and quantifying disease progression. In addition, GANs posses many capabilities relevant to common problems in healthcare: lack of data, class imbalance, rare diseases, and preserving privacy. Unlocking open…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
