Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Assaf Singer; Noam Rotstein; Amir Mann; Ron Kimmel; Or Litany

arXiv:2511.08633·cs.CV·November 13, 2025

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Assaf Singer, Noam Rotstein, Amir Mann, Ron Kimmel, Or Litany

PDF

Open Access 1 Models

TL;DR

Time-to-Move (TTM) is a training-free, plug-and-play framework that enables precise motion and appearance control in diffusion-based video generation using dual-clock denoising and crude reference animations.

Contribution

We introduce TTM, a novel training-free method for motion- and appearance-controlled video synthesis that leverages coarse animations and dual-clock denoising, compatible with any backbone.

Findings

01

Matches or exceeds existing baselines in realism and motion control.

02

Enables pixel-level appearance control surpassing text-only prompts.

03

No additional training or runtime cost.

Abstract

Diffusion-based video generation can create realistic videos, yet existing image- and text-based conditioning fails to offer precise motion control. Prior methods for motion-conditioned synthesis typically require model-specific fine-tuning, which is computationally expensive and restrictive. We introduce Time-to-Move (TTM), a training-free, plug-and-play framework for motion- and appearance-controlled video generation with image-to-video (I2V) diffusion models. Our key insight is to use crude reference animations obtained through user-friendly manipulations such as cut-and-drag or depth-based reprojection. Motivated by SDEdit's use of coarse layout cues for image editing, we treat the crude animations as coarse motion cues and adapt the mechanism to the video domain. We preserve appearance with image conditioning and introduce dual-clock denoising, a region-dependent strategy that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
rmz92002/time-to-move
model· ♡ 6
♡ 6

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Motion and Animation · 3D Shape Modeling and Analysis