Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

Carolina Lopez Olmos; Alexandros Neophytou; Sunando Sengupta; Dim P.; Papadopoulos

arXiv:2406.06352·cs.CV·June 11, 2024·2 cites

Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

Carolina Lopez Olmos, Alexandros Neophytou, Sunando Sengupta, Dim P., Papadopoulos

PDF

Open Access

TL;DR

This paper presents a simple, effective method for bias mitigation in generative AI by learning latent directions in the diffusion process, enabling diverse and inclusive image synthesis without altering prompts or embeddings.

Contribution

Introduces a novel latent space direction learning approach for bias mitigation in generative models, allowing linear combination and integration with text embedding adjustments.

Findings

01

Successfully mitigates geographical biases in generated images

02

Enables linear combination of latent directions for new bias mitigations

03

Provides a tool for developers to select concepts for bias mitigation

Abstract

Mitigating biases in generative AI and, particularly in text-to-image models, is of high importance given their growing implications in society. The biased datasets used for training pose challenges in ensuring the responsible development of these models, and mitigation through hard prompting or embedding alteration, are the most common present solutions. Our work introduces a novel approach to achieve diverse and inclusive synthetic images by learning a direction in the latent space and solely modifying the initial Gaussian noise provided for the diffusion process. Maintaining a neutral prompt and untouched embeddings, this approach successfully adapts to diverse debiasing scenarios, such as geographical biases. Moreover, our work proves it is possible to linearly combine these learned latent directions to introduce new mitigations, and if desired, integrate it with text embedding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI

MethodsDiffusion