RegionDrag: Fast Region-Based Image Editing with Diffusion Models
Jingyi Lu, Xinghui Li, Kai Han

TL;DR
RegionDrag introduces a fast, region-based image editing method using diffusion models, significantly improving speed and accuracy over point-drag techniques by allowing users to specify handle and target regions for precise control.
Contribution
The paper presents a novel region-based editing approach that overcomes the computational and interpretative limitations of point-drag methods, enabling faster and more accurate image edits.
Findings
RegionDrag completes 512x512 image edits in under 2 seconds.
It outperforms DragDiffusion in speed by over 100 times.
RegionDrag achieves better alignment with user intentions.
Abstract
Point-drag-based image editing methods, like DragDiffusion, have attracted significant attention. However, point-drag-based approaches suffer from computational overhead and misinterpretation of user intentions due to the sparsity of point-based editing instructions. In this paper, we propose a region-based copy-and-paste dragging method, RegionDrag, to overcome these limitations. RegionDrag allows users to express their editing instructions in the form of handle and target regions, enabling more precise control and alleviating ambiguity. In addition, region-based operations complete editing in one iteration and are much faster than point-drag-based methods. We also incorporate the attention-swapping technique for enhanced stability during editing. To validate our approach, we extend existing point-drag-based datasets with region-based dragging instructions. Experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
