Synthetically Enhanced: Unveiling Synthetic Data's Potential in Medical Imaging Research
Bardia Khosravi, Frank Li, Theo Dapamede, Pouria Rouzrokh, Cooper U., Gamble, Hari M. Trivedi, Cody C. Wyles, Andrew B. Sellergren, Saptarshi, Purkayastha, Bradley J. Erickson, Judy W. Gichoya

TL;DR
This study demonstrates that using synthetic chest X-ray images generated by diffusion models can significantly enhance the accuracy and generalizability of diagnostic classifiers across diverse datasets.
Contribution
The paper introduces a novel approach of using denoising diffusion probabilistic models to generate synthetic medical images for data augmentation in medical imaging research.
Findings
Synthetic data increased AUROC by up to 0.02.
Classifiers trained solely on synthetic data achieved comparable performance to real data.
Combining synthetic and real data improved model generalizability from 0.76 to 0.80 AUROC.
Abstract
Chest X-rays (CXR) are essential for diagnosing a variety of conditions, but when used on new populations, model generalizability issues limit their efficacy. Generative AI, particularly denoising diffusion probabilistic models (DDPMs), offers a promising approach to generating synthetic images, enhancing dataset diversity. This study investigates the impact of synthetic data supplementation on the performance and generalizability of medical imaging research. The study employed DDPMs to create synthetic CXRs conditioned on demographic and pathological characteristics from the CheXpert dataset. These synthetic images were used to supplement training datasets for pathology classifiers, with the aim of improving their performance. The evaluation involved three datasets (CheXpert, MIMIC-CXR, and Emory Chest X-ray) and various experiments, including supplementing real data with synthetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neuroimaging Techniques and Applications · Radiomics and Machine Learning in Medical Imaging · MRI in cancer diagnosis
MethodsSparse Evolutionary Training · Diffusion
