Exploring Iterative Manifold Constraint for Zero-shot Image Editing
Maomao Li, Yu Li, Yunfei Liu, and Dong Xu

TL;DR
This paper introduces ZZEdit, a zero-shot image editing method that uses an intermediate-inverted latent and iterative manifold constraints to improve editability and fidelity over traditional inversion-then-editing approaches.
Contribution
The paper proposes a novel zero-shot editing paradigm with an intermediate-inverted latent and a ZigZag denoising process, reducing fidelity errors and enhancing editing quality.
Findings
ZZEdit outperforms existing methods in editability and fidelity.
Iterative manifold constraint reduces fidelity errors.
Intermediate-inverted latents improve trade-off between editability and fidelity.
Abstract
Editability and fidelity are two essential demands for text-driven image editing, which expects that the editing area should align with the target prompt and the rest remain unchanged separately. The current cutting-edge editing methods usually obey an "inversion-then-editing" pipeline, where the input image is inverted to an approximate Gaussian noise , based on which a sampling process is conducted using the target prompt. Nevertheless, we argue that it is not a good choice to use a near-Gaussian noise as a pivot for further editing since it would bring plentiful fidelity errors. We verify this by a pilot analysis, discovering that intermediate-inverted latents can achieve a better trade-off between editability and fidelity than the fully-inverted . Based on this, we propose a novel zero-shot editing paradigm dubbed ZZEdit, which first locates a qualified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
MethodsALIGN
