Loading paper
VSA: Faster Video Diffusion with Trainable Sparse Attention | Tomesphere