AdaEraser: Training-Free Object Removal via Adaptive Attention Suppression
Dingming Liu

TL;DR
AdaEraser is a training-free, adaptive attention suppression method that dynamically removes objects from images by improving the inpainting process through token-wise attention modulation.
Contribution
It introduces a novel adaptive framework that modulates self-attention during image denoising, enhancing object removal without training.
Findings
Outperforms existing training-free object removal methods.
Enables progressive object removal through adaptive attention suppression.
Achieves superior results compared to training-based methods.
Abstract
Object removal aims to eliminate specified objects from images while plausibly inpainting the affected regions with background content. Current training-free methods typically block attention to object regions within self-attention layers during the image generation process, leveraging surrounding background information to restore the image. However, indiscriminate suppression of self-attention in the vacated areas can degrade generation quality, as the model must simultaneously reconstruct background content in these regions. To solve this conflict, we propose AdaEraser, an adaptive framework that dynamically modulates attention based on the estimated presence of target object concepts. Through analysis of self-attention map evolution across denoising timesteps before and during removal, we develop a token-wise adaptive attention suppression strategy. This approach enables progressive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
