Neural Human Video Rendering by Learning Dynamic Textures and Rendering-to-Video Translation
Lingjie Liu, Weipeng Xu, Marc Habermann, Michael Zollhoefer, Florian, Bernard, Hyeongwoo Kim, Wenping Wang, Christian Theobalt

TL;DR
This paper introduces a neural human video synthesis method that explicitly models dynamic textures and uses a dual CNN approach to improve temporal coherence and detail preservation in generated videos.
Contribution
It proposes a novel approach that disentangles fine-scale detail learning from 2D embedding, enhancing realism and temporal stability in human video synthesis.
Findings
Significant improvement over state-of-the-art in qualitative quality
Quantitative metrics show better detail and temporal consistency
Effective for human reenactment and view synthesis from monocular videos
Abstract
Synthesizing realistic videos of humans using neural networks has been a popular alternative to the conventional graphics-based rendering pipeline due to its high efficiency. Existing works typically formulate this as an image-to-image translation problem in 2D screen space, which leads to artifacts such as over-smoothing, missing body parts, and temporal instability of fine-scale detail, such as pose-dependent wrinkles in the clothing. In this paper, we propose a novel human video synthesis method that approaches these limiting factors by explicitly disentangling the learning of time-coherent fine-scale details from the embedding of the human in 2D screen space. More specifically, our method relies on the combination of two convolutional neural networks (CNNs). Given the pose information, the first CNN predicts a dynamic texture map that contains time-coherent high-frequency details,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis · Computer Graphics and Visualization Techniques
