A Systematic Study on Quantifying Bias in GAN-Augmented Data
Denis Liu

TL;DR
This paper systematically evaluates metrics for quantifying bias amplification in GAN-augmented data, revealing no single metric reliably measures bias across diverse image domains.
Contribution
It provides a comprehensive assessment of existing metrics for bias quantification in GAN-generated data, highlighting their limitations.
Findings
No single metric reliably quantifies bias across all image domains.
Existing metrics show inconsistent performance in measuring bias.
GANs can exacerbate biases in skewed datasets, affecting data diversity.
Abstract
Generative adversarial networks (GANs) have recently become a popular data augmentation technique used by machine learning practitioners. However, they have been shown to suffer from the so-called mode collapse failure mode, which makes them vulnerable to exacerbating biases on already skewed datasets, resulting in the generated data distribution being less diverse than the training distribution. To this end, we address the problem of quantifying the extent to which mode collapse occurs. This study is a systematic effort focused on the evaluation of state-of-the-art metrics that can potentially quantify biases in GAN-augmented data. We show that, while several such methods are available, there is no single metric that quantifies bias exacerbation reliably over the span of different image domains.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Seismic Imaging and Inversion Techniques
