DNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editing
Chenxi Xie, Minghan Li, Shuai Li, Yuhui Wu, Qiaosi Yi, Lei Zhang

TL;DR
DNAEdit introduces a direct noise alignment technique that refines Gaussian noise in the noise domain, significantly reducing error accumulation in text-guided image editing with diffusion models.
Contribution
The paper proposes DNA, a novel method for noise refinement in rectified flow-based image editing, and introduces MVG for better control, along with a new benchmark for evaluation.
Findings
DNA achieves superior editing accuracy over state-of-the-art methods.
The method reduces noise estimation errors and improves image fidelity.
DNAEdit demonstrates effective prompt-guided image editing with balanced control.
Abstract
Leveraging the powerful generation capability of large-scale pretrained text-to-image models, training-free methods have demonstrated impressive image editing results. Conventional diffusion-based methods, as well as recent rectified flow (RF)-based methods, typically reverse synthesis trajectories by gradually adding noise to clean images, during which the noisy latent at the current timestep is used to approximate that at the next timesteps, introducing accumulated drift and degrading reconstruction accuracy. Considering the fact that in RF the noisy latent is estimated through direct interpolation between Gaussian noises and clean images at each timestep, we propose Direct Noise Alignment (DNA), which directly refines the desired Gaussian noise in the noise domain, significantly reducing the error accumulation in previous methods. Specifically, DNA estimates the velocity field of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMusic and Audio Processing
