Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance
Wenhao Sun, Benlei Cui, Xue-Mei Dong, Jingqun Tang

TL;DR
Attentive Eraser is a tuning-free method that enhances pre-trained diffusion models for stable object removal by redirecting self-attention, resulting in more coherent and artifact-free image editing.
Contribution
The paper introduces ASS and SARG, novel techniques to re-engineer self-attention in diffusion models for improved object removal without additional training.
Findings
Outperforms training-based object removal methods.
Works across various diffusion model architectures.
Ensures stable and coherent object removal results.
Abstract
Recently, diffusion models have emerged as promising newcomers in the field of generative models, shining brightly in image generation. However, when employed for object removal tasks, they still encounter issues such as generating random artifacts and the incapacity to repaint foreground object areas with appropriate content after removal. To tackle these problems, we propose Attentive Eraser, a tuning-free method to empower pre-trained diffusion models for stable and effective object removal. Firstly, in light of the observation that the self-attention maps influence the structure and shape details of the generated images, we propose Attention Activation and Suppression (ASS), which re-engineers the self-attention mechanism within the pre-trained diffusion models based on the given mask, thereby prioritizing the background over the foreground object during the reverse generation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsSoftmax · Attention Is All You Need · Diffusion
