Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models

Eunseo Koh; Seunghoo Hong; Tae-Young Kim; Simon S. Woo; Jae-Pil Heo

arXiv:2508.10407·cs.CV·August 19, 2025

Translation of Text Embedding via Delta Vector to Suppress Strongly Entangled Content in Text-to-Image Diffusion Models

Eunseo Koh, Seunghoo Hong, Tae-Young Kim, Simon S. Woo, Jae-Pil Heo

PDF

TL;DR

This paper introduces a delta vector technique to modify text embeddings, effectively suppressing strongly entangled content in text-to-image diffusion models, improving image generation quality and control.

Contribution

The paper presents a novel delta vector method for suppressing entangled content in diffusion models, including a zero-shot approach and a region-specific suppression mechanism called SSDV.

Findings

01

Significantly reduces undesired content in generated images

02

Outperforms existing suppression methods quantitatively and qualitatively

03

Enables precise suppression in personalized text-to-image models

Abstract

Text-to-Image (T2I) diffusion models have made significant progress in generating diverse high-quality images from textual prompts. However, these models still face challenges in suppressing content that is strongly entangled with specific words. For example, when generating an image of "Charlie Chaplin", a "mustache" consistently appears even if explicitly instructed not to include it, as the concept of "mustache" is strongly entangled with "Charlie Chaplin". To address this issue, we propose a novel approach to directly suppress such entangled content within the text embedding space of diffusion models. Our method introduces a delta vector that modifies the text embedding to weaken the influence of undesired content in the generated image, and we further demonstrate that this delta vector can be easily obtained through a zero-shot approach. Furthermore, we propose a Selective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.