Loading paper
Mimir: Improving Video Diffusion Models for Precise Text Understanding | Tomesphere