MedEqualizer: A Framework Investigating Bias in Synthetic Medical Data and Mitigation via Augmentation
Sama Salarian, Yue Zhang, Swati Padhee, Srinivasan Parthasarathy

TL;DR
This paper evaluates bias in synthetic medical data generated by GANs, introduces MedEqualizer to improve demographic fairness, and demonstrates its effectiveness in creating more balanced healthcare datasets.
Contribution
The study presents MedEqualizer, a novel augmentation framework that reduces demographic disparities in synthetic healthcare data generated by GANs.
Findings
Significant subgroup imbalances exist in synthetic data.
MedEqualizer effectively improves demographic balance.
Enhanced fairness supports more equitable healthcare research.
Abstract
Synthetic healthcare data generation presents a viable approach to enhance data accessibility and support research by overcoming limitations associated with real-world medical datasets. However, ensuring fairness across protected attributes in synthetic data is critical to avoid biased or misleading results in clinical research and decision-making. In this study, we assess the fairness of synthetic data generated by multiple generative adversarial network (GAN)-based models using the MIMIC-III dataset, with a focus on representativeness across protected demographic attributes. We measure subgroup representation using the logarithmic disparity metric and observe significant imbalances, with many subgroups either underrepresented or overrepresented in the synthetic data, compared to the real data. To mitigate these disparities, we introduce MedEqualizer, a model-agnostic augmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Privacy-Preserving Technologies in Data · Artificial Intelligence in Healthcare and Education
