InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models
Min Hou, Yueying Wu, Chang Xu, Yu-Hao Huang, Chenxi Bai, Le Wu, Jiang, Bian

TL;DR
InvDiff is a novel framework that learns invariant semantic features to guide diffusion models, effectively reducing biases without needing bias labels, while preserving high-quality image synthesis.
Contribution
Proposes InvDiff, a lightweight debiasing method that learns invariant features for diffusion models without bias annotations, with theoretical guarantees and empirical effectiveness.
Findings
Reduces biases in diffusion models on multiple benchmarks.
Maintains high image quality during bias mitigation.
Operates with minimal additional parameters.
Abstract
As one of the most successful generative models, diffusion models have demonstrated remarkable efficacy in synthesizing high-quality images. These models learn the underlying high-dimensional data distribution in an unsupervised manner. Despite their success, diffusion models are highly data-driven and prone to inheriting the imbalances and biases present in real-world data. Some studies have attempted to address these issues by designing text prompts for known biases or using bias labels to construct unbiased data. While these methods have shown improved results, real-world scenarios often contain various unknown biases, and obtaining bias labels is particularly challenging. In this paper, we emphasize the necessity of mitigating bias in pre-trained diffusion models without relying on auxiliary bias annotations. To tackle this problem, we propose a framework, InvDiff, which aims to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNuclear reactor physics and engineering · Adversarial Robustness in Machine Learning · Reinforcement Learning in Robotics
MethodsDiffusion
