The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention
Yixin Wan, Di Wu, Haoran Wang, Kai-Wei Chang

TL;DR
This paper introduces DoFaiR, a benchmark to measure the factuality trade-offs in diversity-promoting text-to-image generation, and proposes FAI, a fact-reflection method to improve demographic accuracy without sacrificing diversity.
Contribution
It presents a systematic benchmark for evaluating factuality in diversity interventions and introduces FAI, a novel fact-augmented approach to enhance demographic accuracy in T2I models.
Findings
Diversity prompts increase demographic diversity but reduce factual accuracy.
FAI significantly improves demographic factuality while maintaining diversity.
Diversity interventions cause a factuality trade-off in T2I generation.
Abstract
Prompt-based "diversity interventions" are commonly adopted to improve the diversity of Text-to-Image (T2I) models depicting individuals with various racial or gender traits. However, will this strategy result in nonfactual demographic distribution, especially when generating real historical figures. In this work, we propose DemOgraphic FActualIty Representation (DoFaiR), a benchmark to systematically quantify the trade-off between using diversity interventions and preserving demographic factuality in T2I models. DoFaiR consists of 756 meticulously fact-checked test instances to reveal the factuality tax of various diversity prompts through an automated evidence-supported evaluation pipeline. Experiments on DoFaiR unveil that diversity-oriented instructions increase the number of different gender and racial groups in DALLE-3's generations at the cost of historically inaccurate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Law, AI, and Intellectual Property · Software Engineering Research
