Rethinking Bias in Generative Data Augmentation for Medical AI: a Frequency Recalibration Method
Chi Liu, Jincheng Liu, Congcong Zhu, Minghao Wang, Sheng Shen, Jia Gu, Tianqing Zhu, Wanlei Zhou

TL;DR
This paper introduces FreRec, a frequency recalibration method that reduces bias in generative data augmentation for medical AI, significantly improving classification performance across various medical imaging datasets.
Contribution
The paper proposes a novel frequency recalibration technique, FreRec, to address bias in GDA, enhancing the quality and reliability of synthetic medical images for AI applications.
Findings
FreRec reduces frequency distribution mismatch between real and synthetic images.
FreRec improves downstream classification accuracy across multiple medical datasets.
FreRec is compatible with any generative model and easy to integrate.
Abstract
Developing Medical AI relies on large datasets and easily suffers from data scarcity. Generative data augmentation (GDA) using AI generative models offers a solution to synthesize realistic medical images. However, the bias in GDA is often underestimated in medical domains, with concerns about the risk of introducing detrimental features generated by AI and harming downstream tasks. This paper identifies the frequency misalignment between real and synthesized images as one of the key factors underlying unreliable GDA and proposes the Frequency Recalibration (FreRec) method to reduce the frequency distributional discrepancy and thus improve GDA. FreRec involves (1) Statistical High-frequency Replacement (SHR) to roughly align high-frequency components and (2) Reconstructive High-frequency Mapping (RHM) to enhance image quality and reconstruct high-frequency details. Extensive experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Artificial Intelligence in Healthcare and Education · Domain Adaptation and Few-Shot Learning
