Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising
Assaf Singer, Noam Rotstein, Amir Mann, Ron Kimmel, Or Litany

TL;DR
Time-to-Move (TTM) is a training-free, plug-and-play framework that enables precise motion and appearance control in diffusion-based video generation using dual-clock denoising and crude reference animations.
Contribution
We introduce TTM, a novel training-free method for motion- and appearance-controlled video synthesis that leverages coarse animations and dual-clock denoising, compatible with any backbone.
Findings
Matches or exceeds existing baselines in realism and motion control.
Enables pixel-level appearance control surpassing text-only prompts.
No additional training or runtime cost.
Abstract
Diffusion-based video generation can create realistic videos, yet existing image- and text-based conditioning fails to offer precise motion control. Prior methods for motion-conditioned synthesis typically require model-specific fine-tuning, which is computationally expensive and restrictive. We introduce Time-to-Move (TTM), a training-free, plug-and-play framework for motion- and appearance-controlled video generation with image-to-video (I2V) diffusion models. Our key insight is to use crude reference animations obtained through user-friendly manipulations such as cut-and-drag or depth-based reprojection. Motivated by SDEdit's use of coarse layout cues for image editing, we treat the crude animations as coarse motion cues and adapt the mechanism to the video domain. We preserve appearance with image conditioning and introduce dual-clock denoising, a region-dependent strategy that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · 3D Shape Modeling and Analysis
