Scaling Neural Face Synthesis to High FPS and Low Latency by Neural Caching
Frank Yu, Sid Fels, Helge Rhodin

TL;DR
This paper introduces a neural caching method that significantly reduces latency in neural face synthesis, enabling high frame rate, low-latency telepresence applications with minimal quality loss.
Contribution
It proposes a novel caching and warping technique that breaks layer dependency in neural networks, drastically reducing latency for real-time face rendering.
Findings
Latency reduced by 70% on commodity GPUs
Frame rate scaled across multiple GPUs
Image quality decreased by only 1%
Abstract
Recent neural rendering approaches greatly improve image quality, reaching near photorealism. However, the underlying neural networks have high runtime, precluding telepresence and virtual reality applications that require high resolution at low latency. The sequential dependency of layers in deep networks makes their optimization difficult. We break this dependency by caching information from the previous frame to speed up the processing of the current one with an implicit warp. The warping with a shallow network reduces latency and the caching operations can further be parallelized to improve the frame rate. In contrast to existing temporal neural networks, ours is tailored for the task of rendering novel views of faces by conditioning on the change of the underlying surface mesh. We test the approach on view-dependent rendering of 3D portrait avatars, as needed for telepresence, on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Scaling Neural Face Synthesis to High FPS and Low Latency by Neural Caching· youtube
Taxonomy
TopicsFace recognition and analysis · Advanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis
MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
