Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models
Donggeun Ko, Dongjun Lee, Namjun Park, Wonkyeong Shim, Jaekwang Kim

TL;DR
DiffuBias is a novel method that uses pretrained diffusion models to generate bias-conflict samples, improving classifier robustness and generalization without additional training, and achieves state-of-the-art results in debiasing tasks.
Contribution
This paper introduces DiffuBias, the first approach leveraging stable diffusion models to generate bias-conflict samples for debiasing classifiers.
Findings
Achieves state-of-the-art debiasing performance on benchmark datasets.
Effectively enhances classifier robustness without additional training.
Demonstrates computational efficiency through energy consumption analysis.
Abstract
Neural networks struggle with image classification when biases are learned and misleads correlations, affecting their generalization and performance. Previous methods require attribute labels (e.g. background, color) or utilizes Generative Adversarial Networks (GANs) to mitigate biases. We introduce DiffuBias, a novel pipeline for text-to-image generation that enhances classifier robustness by generating bias-conflict samples, without requiring training during the generation phase. Utilizing pretrained diffusion and image captioning models, DiffuBias generates images that challenge the biases of classifiers, using the top- losses from a biased classifier () to create more representative data samples. This method not only debiases effectively but also boosts classifier generalization capabilities. To the best of our knowledge, DiffuBias is the first approach leveraging a stable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsDiffusion
