DragText: Rethinking Text Embedding in Point-based Image Editing

Gayoon Choi; Taejin Jeong; Sujung Hong; Seong Jae Hwang

arXiv:2407.17843·cs.CV·December 5, 2024

DragText: Rethinking Text Embedding in Point-based Image Editing

Gayoon Choi, Taejin Jeong, Sujung Hong, Seong Jae Hwang

PDF

Open Access 1 Repo

TL;DR

This paper introduces DragText, a method that optimizes text embeddings during point-based image editing to improve content consistency and manipulation accuracy in diffusion models.

Contribution

It explores the interaction between text and image embeddings and proposes a novel optimization of text embeddings during editing, enhancing existing diffusion-based methods.

Findings

01

Improved content preservation during image editing.

02

Enhanced manipulation accuracy with optimized text embeddings.

03

Seamless integration with existing diffusion-based drag methods.

Abstract

Point-based image editing enables accurate and flexible control through content dragging. However, the role of text embedding during the editing process has not been thoroughly investigated. A significant aspect that remains unexplored is the interaction between text and image embeddings. During the progressive editing in a diffusion model, the text embedding remains constant. As the image embedding increasingly diverges from its initial state, the discrepancy between the image and text embeddings presents a significant challenge. In this study, we found that the text prompt significantly influences the dragging process, particularly in maintaining content integrity and achieving the desired manipulation. Upon these insights, we propose DragText, which optimizes text embedding in conjunction with the dragging process to pair with the modified image embedding. Simultaneously, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

MICV-yonsei/DragText
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques

MethodsDiffusion