Expressive Telepresence via Modular Codec Avatars

Hang Chu; Shugao Ma; Fernando De la Torre; Sanja Fidler; Yaser Sheikh

arXiv:2008.11789·cs.CV·August 28, 2020·5 cites

Expressive Telepresence via Modular Codec Avatars

Hang Chu, Shugao Ma, Fernando De la Torre, Sanja Fidler, Yaser Sheikh

PDF

Open Access

TL;DR

This paper introduces Modular Codec Avatars (MCA), a novel approach for creating hyper-realistic, expressive, and robust VR avatars by modularly blending facial components, advancing telepresence technology.

Contribution

MCA extends traditional Codec Avatars with a modular, learned representation that enhances expressiveness and robustness in VR telepresence applications.

Findings

01

MCA outperforms traditional CAs in expressiveness and robustness.

02

MCA demonstrates improved performance across real-world datasets.

03

New VR telepresence applications enabled by MCA.

Abstract

VR telepresence consists of interacting with another human in a virtual space represented by an avatar. Today most avatars are cartoon-like, but soon the technology will allow video-realistic ones. This paper aims in this direction and presents Modular Codec Avatars (MCA), a method to generate hyper-realistic faces driven by the cameras in the VR headset. MCA extends traditional Codec Avatars (CA) by replacing the holistic models with a learned modular representation. It is important to note that traditional person-specific CAs are learned from few training samples, and typically lack robustness as well as limited expressiveness when transferring facial expressions. MCAs solve these issues by learning a modulated adaptive blending of different facial components as well as an exemplar-based latent alignment. We demonstrate that MCA achieves improved expressiveness and robustness w.r.t to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Speech and Audio Processing