FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance

Quanhao Li; Zhen Xing; Rui Wang; Haidong Cao; Qi Dai; Daoguo Dong; Zuxuan Wu

arXiv:2603.12146·cs.CV·March 13, 2026

FlashMotion: Few-Step Controllable Video Generation with Trajectory Guidance

Quanhao Li, Zhen Xing, Rui Wang, Haidong Cao, Qi Dai, Daoguo Dong, Zuxuan Wu

PDF

Open Access 1 Models 1 Datasets

TL;DR

FlashMotion introduces a training framework that enables few-step, trajectory-controllable video generation with high quality and accuracy, significantly reducing computational overhead compared to multi-step methods.

Contribution

The paper proposes a novel training framework combining trajectory adapter training, generator distillation, and hybrid finetuning to improve efficiency and quality in trajectory-controllable video generation.

Findings

01

Outperforms existing methods in visual quality and trajectory accuracy.

02

Reduces generation steps, improving efficiency.

03

Introduces FlashBench benchmark for evaluation.

Abstract

Recent advances in trajectory-controllable video generation have achieved remarkable progress. Previous methods mainly use adapter-based architectures for precise motion control along predefined trajectories. However, all these methods rely on a multi-step denoising process, leading to substantial time redundancy and computational overhead. While existing video distillation methods successfully distill multi-step generators into few-step, directly applying these approaches to trajectory-controllable video generation results in noticeable degradation in both video quality and trajectory accuracy. To bridge this gap, we introduce FlashMotion, a novel training framework designed for few-step trajectory-controllable video generation. We first train a trajectory adapter on a multi-step video generator for precise trajectory control. Then, we distill the generator into a few-step version to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
quanhaol/FlashMotion
model· ♡ 3
♡ 3

Datasets

quanhaol/FlashBench
dataset· 17 dl
17 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Motion and Animation · Generative Adversarial Networks and Image Synthesis