MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
Yuying Ge, Yibing Song, Ruimao Zhang, Ping Luo

TL;DR
MetaDance introduces a meta-learning approach that models temporal dynamics in few-shot dancing video retargeting, enabling high-quality, temporally coherent videos for unseen persons with minimal data.
Contribution
The paper proposes a novel temporal-aware meta-learning framework that improves few-shot dancing video retargeting by capturing temporal correlations, surpassing previous methods that treat frames independently.
Findings
Significantly improves visual quality of generated videos.
Enhances temporal stability in retargeted dancing videos.
Requires only a few frames for effective retargeting.
Abstract
Dancing video retargeting aims to synthesize a video that transfers the dance movements from a source video to a target person. Previous work need collect a several-minute-long video of a target person with thousands of frames to train a personalized model. However, the trained model can only generate videos of the same person. To address the limitations, recent work tackled few-shot dancing video retargeting, which learns to synthesize videos of unseen persons by leveraging a few frames of them. In practice, given a few frames of a person, these work simply regarded them as a batch of individual images without temporal correlations, thus generating temporally incoherent dancing videos of low visual quality. In this work, we model a few frames of a person as a series of dancing moves, where each move contains two consecutive frames, to extract the appearance patterns and the temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Generative Adversarial Networks and Image Synthesis
