Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large   Language Models

Donggeun Ko; Dongjun Lee; Namjun Park; Wonkyeong Shim; Jaekwang Kim

arXiv:2411.16079·cs.CV·November 26, 2024

Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models

Donggeun Ko, Dongjun Lee, Namjun Park, Wonkyeong Shim, Jaekwang Kim

PDF

Open Access

TL;DR

DiffuBias is a novel method that uses pretrained diffusion models to generate bias-conflict samples, improving classifier robustness and generalization without additional training, and achieves state-of-the-art results in debiasing tasks.

Contribution

This paper introduces DiffuBias, the first approach leveraging stable diffusion models to generate bias-conflict samples for debiasing classifiers.

Findings

01

Achieves state-of-the-art debiasing performance on benchmark datasets.

02

Effectively enhances classifier robustness without additional training.

03

Demonstrates computational efficiency through energy consumption analysis.

Abstract

Neural networks struggle with image classification when biases are learned and misleads correlations, affecting their generalization and performance. Previous methods require attribute labels (e.g. background, color) or utilizes Generative Adversarial Networks (GANs) to mitigate biases. We introduce DiffuBias, a novel pipeline for text-to-image generation that enhances classifier robustness by generating bias-conflict samples, without requiring training during the generation phase. Utilizing pretrained diffusion and image captioning models, DiffuBias generates images that challenge the biases of classifiers, using the top- $K$ losses from a biased classifier ( $f_{B}$ ) to create more representative data samples. This method not only debiases effectively but also boosts classifier generalization capabilities. To the best of our knowledge, DiffuBias is the first approach leveraging a stable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsDiffusion