DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusion

Yongji Long; Shijun Liang; Jintao Li; Yun Li

arXiv:2604.20470·cs.CV·April 23, 2026

DynamicRad: Content-Adaptive Sparse Attention for Long Video Diffusion

Yongji Long, Shijun Liang, Jintao Li, Yun Li

PDF

1 Repo

TL;DR

DynamicRad introduces a content-adaptive sparse attention method for long video diffusion, significantly improving efficiency while maintaining high quality through a dual-mode strategy and offline optimization.

Contribution

It presents a novel adaptive sparse attention paradigm with a semantic motion router and offline Bayesian optimization, enhancing long video diffusion performance.

Findings

01

Achieves 1.7×–2.5× inference speedups over dense models.

02

Over 80% effective sparsity in attention mechanisms.

03

Matches or exceeds dense baseline quality in long-sequence settings.

Abstract

Leveraging the natural spatiotemporal energy decay in video diffusion offers a path to efficiency, yet relying solely on rigid static masks risks losing critical long-range information in complex dynamics. To address this issue, we propose \textbf{DynamicRad}, a unified sparse-attention paradigm that grounds adaptive selection within a radial locality prior. DynamicRad introduces a \textbf{dual-mode} strategy: \textit{static-ratio} for speed-optimized execution and \textit{dynamic-threshold} for quality-first filtering. To ensure robustness without online search overhead, we integrate an offline Bayesian Optimization (BO) pipeline coupled with a \textbf{semantic motion router}. This lightweight projection module maps prompt embeddings to optimal sparsity regimes with \textbf{minimal runtime overhead}. Unlike online profiling methods, our offline BO optimizes attention reconstruction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Adamlong3/DynamicRad
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.