Don't Forget your Inverse DDIM for Image Editing
Guillermo Gomez-Trenado, Pablo Mesejo, Oscar Cord\'on, St\'ephane Lathuili\`ere

TL;DR
This paper introduces SAGE, a novel image editing method leveraging pre-trained diffusion models and self-attention guidance to improve reconstruction quality and efficiency in real image editing tasks.
Contribution
SAGE utilizes attention maps during inverse DDIM to enable efficient, high-quality image editing without full image reconstruction, advancing diffusion-based editing techniques.
Findings
SAGE outperforms competing methods in 7 of 10 quantitative metrics.
All 47 users preferred SAGE in a user study.
SAGE achieves top rankings in multiple evaluations.
Abstract
The field of text-to-image generation has undergone significant advancements with the introduction of diffusion models. Nevertheless, the challenge of editing real images persists, as most methods are either computationally intensive or produce poor reconstructions. This paper introduces SAGE (Self-Attention Guidance for image Editing) - a novel technique leveraging pre-trained diffusion models for image editing. SAGE builds upon the DDIM algorithm and incorporates a novel guidance mechanism utilizing the self-attention layers of the diffusion U-Net. This mechanism computes a reconstruction objective based on attention maps generated during the inverse DDIM process, enabling efficient reconstruction of unedited regions without the need to precisely reconstruct the entire input image. Thus, SAGE directly addresses the key challenges in image editing. The superiority of SAGE over other…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Attention Is All You Need · Convolution · Max Pooling · Concatenated Skip Connection · Diffusion · U-Net
