Diverse Diffusion: Enhancing Image Diversity in Text-to-Image Generation
Mariia Zameshina (LIGM), Olivier Teytaud (TAU), Laurent Najman (LIGM)

TL;DR
Diverse Diffusion is a novel unsupervised method that enhances image diversity in text-to-image generation, addressing limitations in gender, ethnicity, and color variety to produce more inclusive and varied AI-generated images.
Contribution
We introduce Diverse Diffusion, a general technique that increases diversity in generated images by finding distant vectors in the latent space of existing models.
Findings
Improved color diversity in generated images
Enhanced ethnicity and gender representation
Higher LPIPS scores indicating greater diversity
Abstract
Latent diffusion models excel at producing high-quality images from text. Yet, concerns appear about the lack of diversity in the generated imagery. To tackle this, we introduce Diverse Diffusion, a method for boosting image diversity beyond gender and ethnicity, spanning into richer realms, including color diversity.Diverse Diffusion is a general unsupervised technique that can be applied to existing text-to-image models. Our approach focuses on finding vectors in the Stable Diffusion latent space that are distant from each other. We generate multiple vectors in the latent space until we find a set of vectors that meets the desired distance requirements and the required batch size.To evaluate the effectiveness of our diversity methods, we conduct experiments examining various characteristics, including color diversity, LPIPS metric, and ethnicity/gender representation in images…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis
MethodsSparse Evolutionary Training · Diffusion
