Neural Human Video Rendering by Learning Dynamic Textures and   Rendering-to-Video Translation

Lingjie Liu; Weipeng Xu; Marc Habermann; Michael Zollhoefer; Florian; Bernard; Hyeongwoo Kim; Wenping Wang; Christian Theobalt

arXiv:2001.04947·cs.GR·July 7, 2021·49 cites

Neural Human Video Rendering by Learning Dynamic Textures and Rendering-to-Video Translation

Lingjie Liu, Weipeng Xu, Marc Habermann, Michael Zollhoefer, Florian, Bernard, Hyeongwoo Kim, Wenping Wang, Christian Theobalt

PDF

Open Access

TL;DR

This paper introduces a neural human video synthesis method that explicitly models dynamic textures and uses a dual CNN approach to improve temporal coherence and detail preservation in generated videos.

Contribution

It proposes a novel approach that disentangles fine-scale detail learning from 2D embedding, enhancing realism and temporal stability in human video synthesis.

Findings

01

Significant improvement over state-of-the-art in qualitative quality

02

Quantitative metrics show better detail and temporal consistency

03

Effective for human reenactment and view synthesis from monocular videos

Abstract

Synthesizing realistic videos of humans using neural networks has been a popular alternative to the conventional graphics-based rendering pipeline due to its high efficiency. Existing works typically formulate this as an image-to-image translation problem in 2D screen space, which leads to artifacts such as over-smoothing, missing body parts, and temporal instability of fine-scale detail, such as pose-dependent wrinkles in the clothing. In this paper, we propose a novel human video synthesis method that approaches these limiting factors by explicitly disentangling the learning of time-coherent fine-scale details from the embedding of the human in 2D screen space. More specifically, our method relies on the combination of two convolutional neural networks (CNNs). Given the pose information, the first CNN predicts a dynamic texture map that contains time-coherent high-frequency details,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques