On the Fairness, Diversity and Reliability of Text-to-Image Generative Models
Jordan Vice, Naveed Akhtar, Leonid Sigal, Richard Hartley, Ajmal Mian

TL;DR
This paper introduces an evaluation framework for assessing the reliability, fairness, and diversity of text-to-image generative models, addressing their vulnerabilities and social implications.
Contribution
It proposes novel methods to analyze model responses to perturbations, evaluate diversity and fairness, and trace embedded biases in text-to-image models.
Findings
Framework effectively identifies unreliable and biased behaviors.
Evaluation of diversity reveals the range of visual concepts generated.
Fairness analysis shows impact of concept removal on control.
Abstract
The rapid proliferation of multimodal generative models has sparked critical discussions on their reliability, fairness and potential for misuse. While text-to-image models excel at producing high-fidelity, user-guided content, they often exhibit unpredictable behaviors and vulnerabilities that can be exploited to manipulate class or concept representations. To address this, we propose an evaluation framework to assess model reliability by analyzing responses to global and local perturbations in the embedding space, enabling the identification of inputs that trigger unreliable or biased behavior. Beyond social implications, fairness and diversity are fundamental to defining robust and trustworthy model behavior. Our approach offers deeper insights into these essential aspects by evaluating: (i) generative diversity, measuring the breadth of visual representations for learned concepts,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods
