Hyper-parameter tuning for text guided image editing

Shiwen Zhang

arXiv:2407.21703·cs.CV·August 1, 2024

Hyper-parameter tuning for text guided image editing

Shiwen Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces Forgedit, a test-time finetuning method for text-guided image editing that efficiently adapts to individual images and effectively solves overfitting issues present in previous state-of-the-art techniques.

Contribution

It presents a hyper-parameter tuning strategy for Forgedit that enhances editing quality and efficiency without increasing workflow complexity.

Findings

01

Forgedit can adapt to images in 30 seconds using fixed hyper-parameters.

02

The method effectively addresses overfitting in text-guided image editing.

03

Hyper-parameter tuning improves the quality of image edits.

Abstract

The test-time finetuning text-guided image editing method, Forgedit, is capable of tackling general and complex image editing problems given only the input image itself and the target text prompt. During finetuning stage, using the same set of finetuning hyper-paramters every time for every given image, Forgedit remembers and understands the input image in 30 seconds. During editing stage, the workflow of Forgedit might seem complicated. However, in fact, the editing process of Forgedit is not more complex than previous SOTA Imagic, yet completely solves the overfitting problem of Imagic. In this paper, we will elaborate the workflow of Forgedit editing stage with examples. We will show how to tune the hyper-parameters in an efficient way to obtain ideal editing results.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

witcherofresearch/forgedit
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques

MethodsSparse Evolutionary Training