Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidance

Song Wu; Xinyu Chen; Qian Wang; Liang Li; Zili Yi; and Junlan Feng

arXiv:2605.15533·cs.CV·May 18, 2026

Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidance

Song Wu, Xinyu Chen, Qian Wang, Liang Li, Zili Yi, and Junlan Feng

PDF

TL;DR

This paper introduces a tuning-free, instruction-based video editing framework that utilizes a novel Structural Noise Initialization Strategy and Noise Guidance Mechanism to improve editing quality and content preservation.

Contribution

It presents a new approach that leverages noisy latent representations for tuning-free video editing, enhancing content change and consistency without extensive training.

Findings

01

Achieves better visual quality than previous methods.

02

State-of-the-art performance in video editing tasks.

03

Effectively preserves unedited content during editing.

Abstract

Video editing poses a significant challenge. While a series of tuning-free methods circumvent the need for extensive data collection and model training, they often underutilize the rich information embedded within noisy latent, leading to unsatisfactory results. To address this, we propose a \textit{tuning-free, instruction-based} video editing framework. We approach video editing from the perspective of noisy latent: we design a Structural Noise Initialization Strategy (SNIS) to secure a superior editing starting point by assigning higher noise levels to edited regions (to facilitate content change) and lower noise levels to unedited regions (to maintain content consistency). We introduce a Noise Guidance Mechanism (NGM), which leverages the video prior in the generative model and effectively integrates rich information within the noisy latent to guide the denoising process, thereby…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.