FairQueue: Rethinking Prompt Learning for Fair Text-to-Image Generation
Christopher T.H Teo, Milad Abdollahzadeh, Xinda Ma, Ngai-man Cheung

TL;DR
FairQueue critically examines prompt learning for fair text-to-image generation, identifying its limitations in sample quality, and introduces novel analysis and techniques to improve image quality while maintaining fairness.
Contribution
This work reveals the limitations of current prompt learning methods, introduces new analysis tools for cross-attention maps, and proposes two techniques to enhance image quality in fair T2I generation.
Findings
Outperforms SOTA in image quality while maintaining fairness
Identifies abnormalities in early denoising steps affecting output quality
Proposes Prompt Queuing and Attention Amplification to improve results
Abstract
Recently, prompt learning has emerged as the state-of-the-art (SOTA) for fair text-to-image (T2I) generation. Specifically, this approach leverages readily available reference images to learn inclusive prompts for each target Sensitive Attribute (tSA), allowing for fair image generation. In this work, we first reveal that this prompt learning-based approach results in degraded sample quality. Our analysis shows that the approach's training objective -- which aims to align the embedding differences of learned prompts and reference images -- could be sub-optimal, resulting in distortion of the learned prompts and degraded generated images. To further substantiate this claim, as our major contribution, we deep dive into the denoising subnetwork of the T2I model to track down the effect of these learned prompts by analyzing the cross-attention maps. In our analysis, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsArtificial Intelligence in Law · Law in Society and Culture · Ethics and Social Impacts of AI
MethodsSoftmax · Attention Is All You Need · ALIGN
