Reproducing DragDiffusion: Interactive Point-Based Editing with Diffusion Models
Ali Subhan, Ashir Raza

TL;DR
This paper conducts a reproducibility study of DragDiffusion, a diffusion-based interactive image editing method, confirming its core claims while analyzing sensitivity to hyperparameters and evaluating a multi-timestep variant.
Contribution
It provides a detailed reproducibility analysis of DragDiffusion, clarifying the conditions for reliable results and assessing the impact of hyperparameters and modifications.
Findings
Reproduced main ablation studies with close agreement.
Performance is sensitive to certain hyperparameters like timestep and feature level.
Multi-timestep optimization does not improve accuracy but increases computational cost.
Abstract
DragDiffusion is a diffusion-based method for interactive point-based image editing that enables users to manipulate images by directly dragging selected points. The method claims that accurate spatial control can be achieved by optimizing a single diffusion latent at an intermediate timestep, together with identity-preserving fine-tuning and spatial regularization. This work presents a reproducibility study of DragDiffusion using the authors' released implementation and the DragBench benchmark. We reproduce the main ablation studies on diffusion timestep selection, LoRA-based fine-tuning, mask regularization strength, and UNet feature supervision, and observe close agreement with the qualitative and quantitative trends reported in the original work. At the same time, our experiments show that performance is sensitive to a small number of hyperparameter assumptions, particularly the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Advanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis
