FastVMT: Eliminating Redundancy in Video Motion Transfer

Yue Ma; Zhikai Wang; Tianhao Ren; Mingzhe Zheng; Hongyu Liu; Jiayi Guo; Kunyu Feng; Yuxuan Xue; Zixiang Zhao; Konrad Schindler; Qifeng Chen; Linfeng Zhang

arXiv:2602.05551·cs.CV·March 31, 2026

FastVMT: Eliminating Redundancy in Video Motion Transfer

Yue Ma, Zhikai Wang, Tianhao Ren, Mingzhe Zheng, Hongyu Liu, Jiayi Guo, Kunyu Feng, Yuxuan Xue, Zixiang Zhao, Konrad Schindler, Qifeng Chen, Linfeng Zhang

PDF

1 Video

TL;DR

FastVMT enhances video motion transfer by removing computational redundancies, achieving over three times faster processing without sacrificing quality or consistency.

Contribution

The paper introduces novel techniques to eliminate motion and gradient redundancies in DiT-based video transfer, significantly improving speed.

Findings

01

FastVMT achieves a 3.43x speedup on average.

02

It maintains visual fidelity and temporal consistency.

03

Redundancy removal does not degrade output quality.

Abstract

Video motion transfer aims to synthesize videos by generating visual content according to a text prompt while transferring the motion pattern observed in a reference video. Recent methods predominantly use the Diffusion Transformer (DiT) architecture. To achieve satisfactory runtime, several methods attempt to accelerate the computations in the DiT, but fail to address structural sources of inefficiency. In this work, we identify and remove two types of computational redundancy in earlier work: motion redundancy arises because the generic DiT architecture does not reflect the fact that frame-to-frame motion is small and smooth; gradient redundancy occurs if one ignores that gradients change slowly along the diffusion trajectory. To mitigate motion redundancy, we mask the corresponding attention layers to a local neighborhood such that interaction weights are not computed unnecessarily…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

FastVMT: Eliminating Redundancy in Video Motion Transfer· slideslive