Evaluating and Improving the Effectiveness of Synthetic Chest X-Rays for Medical Image Analysis
Eva Prakash, Jeya Maria Jose Valanarasu, Zhihong Chen, Eduardo Pontes Reis, Andrew Johnston, Anuj Pareek, Christian Bluethgen, Sergios Gatidis, Cameron Olsen, Akshay Chaudhari, Andrew Ng, Curtis Langlotz

TL;DR
This study investigates methods to generate and utilize synthetic chest X-ray images to enhance deep learning model performance in medical image classification and segmentation tasks, demonstrating significant improvements through optimized synthetic data augmentation.
Contribution
It introduces a latent diffusion model conditioned on text and segmentation masks for synthetic X-ray generation, and evaluates best practices for dataset augmentation in medical imaging.
Findings
Synthetic data improved classification F1 scores by up to 0.15
Segmentation Dice scores increased by up to 0.146
Using proxy models and radiologist feedback enhances synthetic image quality
Abstract
Purpose: To explore best-practice approaches for generating synthetic chest X-ray images and augmenting medical imaging datasets to optimize the performance of deep learning models in downstream tasks like classification and segmentation. Materials and Methods: We utilized a latent diffusion model to condition the generation of synthetic chest X-rays on text prompts and/or segmentation masks. We explored methods like using a proxy model and using radiologist feedback to improve the quality of synthetic data. These synthetic images were then generated from relevant disease information or geometrically transformed segmentation masks and added to ground truth training set images from the CheXpert, CANDID-PTX, SIIM, and RSNA Pneumonia datasets to measure improvements in classification and segmentation model performance on the test sets. F1 and Dice scores were used to evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI · Advanced X-ray and CT Imaging
MethodsLatent Diffusion Model · Sparse Evolutionary Training · Diffusion
