DiffUHaul: A Training-Free Method for Object Dragging in Images
Omri Avrahami, Rinon Gal, Gal Chechik, Ohad Fried, Dani Lischinski,, Arash Vahdat, Weili Nie

TL;DR
DiffUHaul is a training-free, diffusion-based method that enables seamless object dragging in images by leveraging spatial reasoning and attention mechanisms, improving real-world image editing without retraining models.
Contribution
The paper introduces DiffUHaul, a novel training-free approach that uses attention masking, diffusion anchoring, and self-attention bucketing for effective object dragging in images.
Findings
Achieves reliable object dragging in real images.
Outperforms existing methods in editing quality.
Validated through user preference studies.
Abstract
Text-to-image diffusion models have proven effective for solving many image editing tasks. However, the seemingly straightforward task of seamlessly relocating objects within a scene remains surprisingly challenging. Existing methods addressing this problem often struggle to function reliably in real-world scenarios due to lacking spatial reasoning. In this work, we propose a training-free method, dubbed DiffUHaul, that harnesses the spatial understanding of a localized text-to-image model, for the object dragging task. Blindly manipulating layout inputs of the localized model tends to cause low editing performance due to the intrinsic entanglement of object representation in the model. To this end, we first apply attention masking in each denoising step to make the generation more disentangled across different objects and adopt the self-attention sharing mechanism to preserve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Digital Imaging for Blood Diseases · AI in cancer detection
MethodsDiffusion
