FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid   Editing

Gwanhyeong Koo; Sunjae Yoon; Ji Woo Hong; Chang D. Yoo

arXiv:2407.17850·cs.CV·July 26, 2024

FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing

Gwanhyeong Koo, Sunjae Yoon, Ji Woo Hong, Chang D. Yoo

PDF

Open Access 1 Repo

TL;DR

FlexiEdit improves non-rigid image editing by refining DDIM latent representations, especially reducing high-frequency components, to better preserve original image features and accurately reflect input prompts.

Contribution

The paper introduces FlexiEdit, a novel method that enhances non-rigid image editing by refining DDIM latent space to improve fidelity and layout adjustments.

Findings

01

Enhanced editing fidelity demonstrated in experiments

02

Better preservation of original image features

03

Improved handling of complex non-rigid edits

Abstract

Current image editing methods primarily utilize DDIM Inversion, employing a two-branch diffusion approach to preserve the attributes and layout of the original image. However, these methods encounter challenges with non-rigid edits, which involve altering the image's layout or structure. Our comprehensive analysis reveals that the high-frequency components of DDIM latent, crucial for retaining the original image's key features and layout, significantly contribute to these limitations. Addressing this, we introduce FlexiEdit, which enhances fidelity to input text prompts by refining DDIM latent, by reducing high-frequency components in targeted editing areas. FlexiEdit comprises two key components: (1) Latent Refinement, which modifies DDIM latent to better accommodate layout adjustments, and (2) Edit Fidelity Enhancement via Re-inversion, aimed at ensuring the edits more accurately…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kookie12/FlexiEdit
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Data Storage Technologies · Caching and Content Delivery · Topic Modeling

MethodsDiffusion