MotionCraft: Physics-based Zero-Shot Video Generation

Luca Savant Aira; Antonio Montanaro; Emanuele Aiello; Diego Valsesia,; Enrico Magli

arXiv:2405.13557·cs.LG·October 28, 2024

MotionCraft: Physics-based Zero-Shot Video Generation

Luca Savant Aira, Antonio Montanaro, Emanuele Aiello, Diego Valsesia,, Enrico Magli

PDF

Open Access 1 Repo 1 Video

TL;DR

MotionCraft introduces a zero-shot video generation method that leverages physics-based optical flow to produce realistic, coherent videos with complex motion, outperforming existing models without extensive training.

Contribution

It warps the noise latent space of diffusion models using physics-based optical flow, enabling realistic zero-shot video synthesis with complex motion dynamics.

Findings

01

Outperforms Text2Video-Zero in qualitative and quantitative metrics

02

Generates physically plausible and coherent videos with complex motion

03

Requires no additional training or fine-tuning

Abstract

Generating videos with realistic and physically plausible motion is one of the main recent challenges in computer vision. While diffusion models are achieving compelling results in image generation, video diffusion models are limited by heavy training and huge models, resulting in videos that are still biased to the training dataset. In this work we propose MotionCraft, a new zero-shot video generator to craft physics-based and realistic videos. MotionCraft is able to warp the noise latent space of an image diffusion model, such as Stable Diffusion, by applying an optical flow derived from a physics simulation. We show that warping the noise latent space results in coherent application of the desired motion while allowing the model to generate missing elements consistent with the scene evolution, which would otherwise result in artefacts or missing content if the flow was applied in the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mezzelfo/MotionCraft
pytorchOfficial

Videos

MotionCraft: Physics-Based Zero-Shot Video Generation· slideslive

Taxonomy

TopicsCinema and Media Studies · Image and Video Quality Assessment · Video Analysis and Summarization

MethodsDiffusion