MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Yuang Zhang; Jiaxi Gu; Li-Wen Wang; Han Wang; Junqi Cheng; Yuefeng Zhu; Fangyuan Zou

arXiv:2406.19680·cs.CV·June 30, 2025·2 cites

MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Yuang Zhang, Jiaxi Gu, Li-Wen Wang, Han Wang, Junqi Cheng, Yuefeng Zhu, Fangyuan Zou

PDF

Open Access 1 Repo 1 Video

TL;DR

MimicMotion is a controllable video generation framework that produces high-quality, long, and smooth human motion videos by leveraging confidence-aware pose guidance and a progressive latent fusion strategy.

Contribution

The paper introduces confidence-aware pose guidance, regional loss amplification, and a progressive latent fusion strategy for high-quality, controllable, and long human motion video generation.

Findings

01

Significant quality improvements over previous methods.

02

Effective generation of arbitrarily long videos.

03

Enhanced temporal smoothness and detail richness.

Abstract

In recent years, generative artificial intelligence has achieved significant advancements in the field of image generation, spawning a variety of applications. However, video generation still faces considerable challenges in various aspects, such as controllability, video length, and richness of details, which hinder the application and popularization of this technology. In this work, we propose a controllable video generation framework, dubbed MimicMotion, which can generate high-quality videos of arbitrary length mimicking specific motion guidance. Compared with previous methods, our approach has several highlights. Firstly, we introduce confidence-aware pose guidance that ensures high frame quality and temporal smoothness. Secondly, we introduce regional loss amplification based on pose confidence, which significantly reduces image distortion. Lastly, for generating long and smooth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vita-epfl/stay-on-track
pytorch

Videos

MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance· slideslive

Taxonomy

TopicsAdvanced Vision and Imaging · Human Motion and Animation · Human Pose and Action Recognition