TL;DR
MotionCache introduces a motion-aware caching strategy for autoregressive video generation, significantly speeding up process while maintaining quality by dynamically adjusting denoising steps based on pixel motion.
Contribution
It formalizes the link between cache errors and residual instability and proposes a lightweight, motion-aware cache framework with a coarse-to-fine strategy for improved efficiency.
Findings
Achieves 6.28x speedup on SkyReels-V2
Achieves 1.64x speedup on MAGI-1
Preserves generation quality with minimal degradation
Abstract
Autoregressive video generation paradigms offer theoretical promise for long video synthesis, yet their practical deployment is hindered by the computational burden of sequential iterative denoising. While cache reuse strategies can accelerate generation by skipping redundant denoising steps, existing methods rely on coarse-grained chunk-level skipping that fails to capture fine-grained pixel dynamics. This oversight is critical: pixels with high motion require more denoising steps to prevent error accumulation, while static pixels tolerate aggressive skipping. We formalize this insight theoretically by linking cache errors to residual instability, and propose MotionCache, a motion-aware cache framework that exploits inter-frame differences as a lightweight proxy for pixel-level motion characteristics. MotionCache employs a coarse-to-fine strategy: an initial warm-up phase establishes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
