PoseAnything: Universal Pose-guided Video Generation with Part-aware Temporal Coherence
Ruiyan Wang, Teng Hu, Kaihui Huang, Zihan Su, Ran Yi, Lizhuang Ma

TL;DR
PoseAnything is a universal framework for pose-guided video generation that handles both human and non-human subjects, introducing part-aware coherence and independent camera control, supported by a new large-scale dataset.
Contribution
It is the first universal pose-guided video generation method supporting arbitrary skeletal inputs and independent camera motion control.
Findings
Outperforms state-of-the-art methods in effectiveness.
Demonstrates strong generalization to non-human subjects.
Provides a new dataset with 50,000 non-human pose-video pairs.
Abstract
Pose-guided video generation refers to controlling the motion of subjects in generated video through a sequence of poses. It enables precise control over subject motion and has important applications in animation. However, current pose-guided video generation methods are limited to accepting only human poses as input, thus generalizing poorly to pose of other subjects. To address this issue, we propose PoseAnything, the first universal pose-guided video generation framework capable of handling both human and non-human characters, supporting arbitrary skeletal inputs. To enhance consistency preservation during motion, we introduce Part-aware Temporal Coherence Module, which divides the subject into different parts, establishes part correspondences, and computes cross-attention between corresponding parts across frames to achieve fine-grained part-level consistency. Additionally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
