Robust Pose Transfer with Dynamic Details using Neural Video Rendering
Yang-tian Sun, Hao-zhi Huang, Xuan Wang, Yu-kun Lai, Wei Liu, Lin Gao

TL;DR
This paper introduces a neural video rendering framework with a novel texture representation and temporal loss, enabling high-quality pose transfer with dynamic details from short monocular videos, outperforming existing methods.
Contribution
It proposes a new neural rendering approach combining explicit 3D features and learned components, with a texture representation and temporal loss to improve detail preservation and stability.
Findings
Achieves clearer dynamic details in pose transfer videos.
Performs robustly on short videos with only 2k-4k frames.
Outperforms existing methods in detail quality and stability.
Abstract
Pose transfer of human videos aims to generate a high fidelity video of a target person imitating actions of a source person. A few studies have made great progress either through image translation with deep latent features or neural rendering with explicit 3D features. However, both of them rely on large amounts of training data to generate realistic results, and the performance degrades on more accessible internet videos due to insufficient training frames. In this paper, we demonstrate that the dynamic details can be preserved even trained from short monocular videos. Overall, we propose a neural video rendering framework coupled with an image-translation-based dynamic details generation network (D2G-Net), which fully utilizes both the stability of explicit 3D features and the capacity of learning components. To be specific, a novel texture representation is presented to encode both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Advanced Image Processing Techniques
