When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems

Shesh Narayan Gupta; Nik Bear Brown

arXiv:2603.16134·cs.CV·March 18, 2026

When Generative Augmentation Hurts: A Benchmark Study of GAN and Diffusion Models for Bias Correction in AI Classification Systems

Shesh Narayan Gupta, Nik Bear Brown

PDF

Open Access

TL;DR

This study benchmarks generative augmentation methods for bias correction in AI classification, revealing that certain GANs can worsen bias under low-data conditions, while diffusion models perform better.

Contribution

It provides a controlled comparison of augmentation strategies, highlighting the risks of GANs and the effectiveness of diffusion models with LoRA in bias mitigation.

Findings

01

FastGAN increases classifier bias under low-data conditions

02

Stable Diffusion with LoRA reduces bias and improves F1 score

03

GAN augmentation can be harmful below 20-50 training images per class

Abstract

Generative models are widely used to compensate for class imbalance in AI training pipelines, yet their failure modes under low-data conditions are poorly understood. This paper reports a controlled benchmark comparing three augmentation strategies applied to a fine-grained animal classification task: traditional transforms, FastGAN, and Stable Diffusion 1.5 fine-tuned with Low-Rank Adaptation (LoRA). Using the Oxford-IIIT Pet Dataset with eight artificially underrepresented breeds, we find that FastGAN augmentation does not merely underperform at very low training set sizes but actively increases classifier bias, with a statistically significant large effect across three random seeds (bias gap increase: +20.7%, Cohen's d = +5.03, p = 0.013). The effect size here is large enough to give confidence in the direction of the finding despite the small number of seeds. Feature embedding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning