Spatiotemporal Skip Guidance for Enhanced Video Diffusion Sampling
Junha Hyung, Kinam Kim, Susung Hong, Min-Jung Kim, Jaegul Choo

TL;DR
This paper introduces Spatiotemporal Skip Guidance (STG), a training-free method that improves video diffusion sampling quality by simulating a weak model through layer skipping, without sacrificing diversity or motion.
Contribution
STG is a novel, training-free guidance technique that enhances transformer-based video diffusion models by selectively skipping layers to simulate a weak model.
Findings
STG improves sample quality in video diffusion models.
STG does not reduce diversity or motion in generated videos.
STG eliminates the need for auxiliary models or additional training.
Abstract
Diffusion models have emerged as a powerful tool for generating high-quality images, videos, and 3D content. While sampling guidance techniques like CFG improve quality, they reduce diversity and motion. Autoguidance mitigates these issues but demands extra weak model training, limiting its practicality for large-scale models. In this work, we introduce Spatiotemporal Skip Guidance (STG), a simple training-free sampling guidance method for enhancing transformer-based video diffusion models. STG employs an implicit weak model via self-perturbation, avoiding the need for external models or additional training. By selectively skipping spatiotemporal layers, STG produces an aligned, degraded version of the original model to boost sample quality without compromising diversity or dynamic degree. Our contributions include: (1) introducing STG as an efficient, high-performing guidance technique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhotoacoustic and Ultrasonic Imaging · Medical Imaging Techniques and Applications · Image and Signal Denoising Methods
MethodsDiffusion
