Scaling Neural Face Synthesis to High FPS and Low Latency by Neural   Caching

Frank Yu; Sid Fels; Helge Rhodin

arXiv:2211.05773·cs.CV·November 11, 2022

Scaling Neural Face Synthesis to High FPS and Low Latency by Neural Caching

Frank Yu, Sid Fels, Helge Rhodin

PDF

Open Access 1 Video

TL;DR

This paper introduces a neural caching method that significantly reduces latency in neural face synthesis, enabling high frame rate, low-latency telepresence applications with minimal quality loss.

Contribution

It proposes a novel caching and warping technique that breaks layer dependency in neural networks, drastically reducing latency for real-time face rendering.

Findings

01

Latency reduced by 70% on commodity GPUs

02

Frame rate scaled across multiple GPUs

03

Image quality decreased by only 1%

Abstract

Recent neural rendering approaches greatly improve image quality, reaching near photorealism. However, the underlying neural networks have high runtime, precluding telepresence and virtual reality applications that require high resolution at low latency. The sequential dependency of layers in deep networks makes their optimization difficult. We break this dependency by caching information from the previous frame to speed up the processing of the current one with an implicit warp. The warping with a shallow network reduces latency and the caching operations can further be parallelized to improve the frame rate. In contrast to existing temporal neural networks, ours is tailored for the task of rendering novel views of faces by conditioning on the change of the underlying surface mesh. We test the approach on view-dependent rendering of 3D portrait avatars, as needed for telepresence, on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Scaling Neural Face Synthesis to High FPS and Low Latency by Neural Caching· youtube

Taxonomy

TopicsFace recognition and analysis · Advanced Vision and Imaging · Generative Adversarial Networks and Image Synthesis

MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings