Free-viewpoint Human Animation with Pose-correlated Reference Selection

Fa-Ting Hong; Zhan Xu; Haiyang Liu; Qinjie Lin; Luchuan Song; Zhixin; Shu; Yang Zhou; Duygu Ceylan; Dan Xu

arXiv:2412.17290·cs.CV·December 30, 2024

Free-viewpoint Human Animation with Pose-correlated Reference Selection

Fa-Ting Hong, Zhan Xu, Haiyang Liu, Qinjie Lin, Luchuan Song, Zhixin, Shu, Yang Zhou, Duygu Ceylan, Dan Xu

PDF

Open Access

TL;DR

This paper introduces a diffusion-based human animation method that effectively handles large viewpoint changes by using multiple reference images and an adaptive selection strategy, improving realism and flexibility.

Contribution

The paper proposes a pose-correlated reference selection diffusion network with an innovative pose correlation module and adaptive reference strategy for enhanced viewpoint variation handling.

Findings

01

Outperforms state-of-the-art methods under large viewpoint changes.

02

Utilizes multiple references to preserve appearance details across views.

03

Adaptive reference selection improves animation quality in free viewpoints.

Abstract

Diffusion-based human animation aims to animate a human character based on a source human image as well as driving signals such as a sequence of poses. Leveraging the generative capacity of diffusion model, existing approaches are able to generate high-fidelity poses, but struggle with significant viewpoint changes, especially in zoom-in/zoom-out scenarios where camera-character distance varies. This limits the applications such as cinematic shot type plan or camera control. We propose a pose-correlated reference selection diffusion network, supporting substantial viewpoint variations in human animation. Our key idea is to enable the network to utilize multiple reference images as input, since significant viewpoint changes often lead to missing appearance details on the human body. To eliminate the computational cost, we first introduce a novel pose correlation module to compute…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Advanced Vision and Imaging · 3D Shape Modeling and Analysis

MethodsSoftmax · Attention Is All You Need · Diffusion