Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance
Reyhane Askari Hemmat, Melissa Hall, Alicia Sun, Candace Ross, Michal, Drozdzal, Adriana Romero-Soriano

TL;DR
This paper introduces a new inference-time method called c-VSG that guides diffusion models to produce more geographically diverse images of common objects, aligning better with real-world regional variations.
Contribution
The work presents c-VSG, a novel guidance technique that enhances geographic diversity in generated images while preserving quality, addressing biases in existing models.
Findings
Significantly improves regional diversity in generated images.
Maintains or enhances image quality and consistency.
Reduces regional bias and portrayals in generated images.
Abstract
With the growing popularity of text-to-image generative models, there has been increasing focus on understanding their risks and biases. Recent work has found that state-of-the-art models struggle to depict everyday objects with the true diversity of the real world and have notable gaps between geographic regions. In this work, we aim to increase the diversity of generated images of common objects such that per-region variations are representative of the real world. We introduce an inference time intervention, contextualized Vendi Score Guidance (c-VSG), that guides the backwards steps of latent diffusion models to increase the diversity of a sample as compared to a "memory bank" of previously generated images while constraining the amount of variation within that of an exemplar set of real-world contextualizing images. We evaluate c-VSG with two geographically representative datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Satellite Image Processing and Photogrammetry
MethodsSparse Evolutionary Training · Focus · Diffusion
