Deeper Diffusion Models Amplify Bias
Shahin Hakemi, Naveed Akhtar, Ghulam Mubashar Hassan, Ajmal Mian

TL;DR
This paper investigates how deep diffusion models can amplify biases present in training data and potentially compromise privacy, highlighting risks associated with their internal workings and the bias-variance tradeoff.
Contribution
It provides a systematic theoretical and empirical analysis of bias amplification and privacy risks in deep diffusion models, expanding understanding beyond traditional generalization.
Findings
Deeper diffusion models tend to amplify training data bias.
Diffusion models can compromise training sample privacy.
Theoretical and empirical validation of bias and privacy risks.
Abstract
Despite the remarkable performance of generative Diffusion Models (DMs), their internal working is still not well understood, which is potentially problematic. This paper focuses on exploring the important notion of bias-variance tradeoff in diffusion models. Providing a systematic foundation for this exploration, it establishes that at one extreme, the diffusion models may amplify the inherent bias in the training data, and on the other, they may compromise the presumed privacy of the training samples. Our exploration aligns with the memorization-generalization understanding of the generative models, but it also expands further along this spectrum beyond "generalization", revealing the risk of bias amplification in deeper models. Our claims are validated both theoretically and empirically.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
