Robust Pose Transfer with Dynamic Details using Neural Video Rendering

Yang-tian Sun; Hao-zhi Huang; Xuan Wang; Yu-kun Lai; Wei Liu; Lin Gao

arXiv:2106.14132·cs.CV·May 9, 2023

Robust Pose Transfer with Dynamic Details using Neural Video Rendering

Yang-tian Sun, Hao-zhi Huang, Xuan Wang, Yu-kun Lai, Wei Liu, Lin Gao

PDF

Open Access

TL;DR

This paper introduces a neural video rendering framework with a novel texture representation and temporal loss, enabling high-quality pose transfer with dynamic details from short monocular videos, outperforming existing methods.

Contribution

It proposes a new neural rendering approach combining explicit 3D features and learned components, with a texture representation and temporal loss to improve detail preservation and stability.

Findings

01

Achieves clearer dynamic details in pose transfer videos.

02

Performs robustly on short videos with only 2k-4k frames.

03

Outperforms existing methods in detail quality and stability.

Abstract

Pose transfer of human videos aims to generate a high fidelity video of a target person imitating actions of a source person. A few studies have made great progress either through image translation with deep latent features or neural rendering with explicit 3D features. However, both of them rely on large amounts of training data to generate realistic results, and the performance degrades on more accessible internet videos due to insufficient training frames. In this paper, we demonstrate that the dynamic details can be preserved even trained from short monocular videos. Overall, we propose a neural video rendering framework coupled with an image-translation-based dynamic details generation network (D2G-Net), which fully utilizes both the stability of explicit 3D features and the capacity of learning components. To be specific, a novel texture representation is presented to encode both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Advanced Image Processing Techniques