MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation

Yanchen Liu; Yanan Sun; Zhening Xing; Junyao Gao; Kai Chen; Wenjie Pei

arXiv:2507.16310·cs.CV·July 23, 2025

MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation

Yanchen Liu, Yanan Sun, Zhening Xing, Junyao Gao, Kai Chen, Wenjie Pei

PDF

Open Access

TL;DR

MotionShot is a training-free framework that enables high-fidelity, coherent motion transfer across arbitrary objects in text-to-video generation by combining semantic matching and shape retargeting.

Contribution

It introduces a novel, training-free method for fine-grained motion transfer that handles significant appearance and structure differences.

Findings

01

Effective motion transfer across diverse objects

02

Preserves appearance coherence during transfer

03

Demonstrates superior performance in experiments

Abstract

Existing text-to-video methods struggle to transfer motion smoothly from a reference object to a target object with significant differences in appearance or structure between them. To address this challenge, we introduce MotionShot, a training-free framework capable of parsing reference-target correspondences in a fine-grained manner, thereby achieving high-fidelity motion transfer while preserving coherence in appearance. To be specific, MotionShot first performs semantic feature matching to ensure high-level alignments between the reference and target objects. It then further establishes low-level morphological alignments through reference-to-target shape retargeting. By encoding motion with temporal attention, our MotionShot can coherently transfer motion across objects, even in the presence of significant appearance and structure disparities, demonstrated by extensive experiments.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Multimedia Communication and Technology · Video Analysis and Summarization