Diverse Diffusion: Enhancing Image Diversity in Text-to-Image Generation

Mariia Zameshina (LIGM); Olivier Teytaud (TAU); Laurent Najman (LIGM)

arXiv:2310.12583·cs.CV·October 20, 2023·1 cites

Diverse Diffusion: Enhancing Image Diversity in Text-to-Image Generation

Mariia Zameshina (LIGM), Olivier Teytaud (TAU), Laurent Najman (LIGM)

PDF

Open Access

TL;DR

Diverse Diffusion is a novel unsupervised method that enhances image diversity in text-to-image generation, addressing limitations in gender, ethnicity, and color variety to produce more inclusive and varied AI-generated images.

Contribution

We introduce Diverse Diffusion, a general technique that increases diversity in generated images by finding distant vectors in the latent space of existing models.

Findings

01

Improved color diversity in generated images

02

Enhanced ethnicity and gender representation

03

Higher LPIPS scores indicating greater diversity

Abstract

Latent diffusion models excel at producing high-quality images from text. Yet, concerns appear about the lack of diversity in the generated imagery. To tackle this, we introduce Diverse Diffusion, a method for boosting image diversity beyond gender and ethnicity, spanning into richer realms, including color diversity.Diverse Diffusion is a general unsupervised technique that can be applied to existing text-to-image models. Our approach focuses on finding vectors in the Stable Diffusion latent space that are distant from each other. We generate multiple vectors in the latent space until we find a set of vectors that meets the desired distance requirements and the required batch size.To evaluate the effectiveness of our diversity methods, we conduct experiments examining various characteristics, including color diversity, LPIPS metric, and ethnicity/gender representation in images…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis

MethodsSparse Evolutionary Training · Diffusion