Zero-shot High-fidelity and Pose-controllable Character Animation

Bingwen Zhu; Fanyi Wang; Tianyi Lu; Peng Liu; Jingwen Su; Jinxiu Liu,; Yanhao Zhang; Zuxuan Wu; Guo-Jun Qi; Yu-Gang Jiang

arXiv:2404.13680·cs.CV·June 6, 2024·2 cites

Zero-shot High-fidelity and Pose-controllable Character Animation

Bingwen Zhu, Fanyi Wang, Tianyi Lu, Peng Liu, Jingwen Su, Jinxiu Liu,, Yanhao Zhang, Zuxuan Wu, Guo-Jun Qi, Yu-Gang Jiang

PDF

Open Access

TL;DR

This paper introduces PoseAnimate, a zero-shot framework for character animation from a single image, achieving high fidelity, temporal coherence, and pose control without extensive training data.

Contribution

The paper presents PoseAnimate, a novel zero-shot I2V method with three modules and a transition algorithm, improving character consistency and detail preservation over existing approaches.

Findings

01

Outperforms state-of-the-art methods in character consistency

02

Maintains high temporal coherence in generated videos

03

Effectively preserves fine details and background fidelity

Abstract

Image-to-video (I2V) generation aims to create a video sequence from a single image, which requires high temporal coherence and visual fidelity. However, existing approaches suffer from inconsistency of character appearances and poor preservation of fine details. Moreover, they require a large amount of video data for training, which can be computationally demanding. To address these limitations, we propose PoseAnimate, a novel zero-shot I2V framework for character animation. PoseAnimate contains three key components: 1) a Pose-Aware Control Module (PACM) that incorporates diverse pose signals into text embeddings, to preserve character-independent content and maintain precise alignment of actions. 2) a Dual Consistency Attention Module (DCAM) that enhances temporal consistency and retains character identity and intricate background details. 3) a Mask-Guided Decoupling Module (MGDM)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · 3D Shape Modeling and Analysis · Human Pose and Action Recognition