Expressive Telepresence via Modular Codec Avatars
Hang Chu, Shugao Ma, Fernando De la Torre, Sanja Fidler, Yaser Sheikh

TL;DR
This paper introduces Modular Codec Avatars (MCA), a novel approach for creating hyper-realistic, expressive, and robust VR avatars by modularly blending facial components, advancing telepresence technology.
Contribution
MCA extends traditional Codec Avatars with a modular, learned representation that enhances expressiveness and robustness in VR telepresence applications.
Findings
MCA outperforms traditional CAs in expressiveness and robustness.
MCA demonstrates improved performance across real-world datasets.
New VR telepresence applications enabled by MCA.
Abstract
VR telepresence consists of interacting with another human in a virtual space represented by an avatar. Today most avatars are cartoon-like, but soon the technology will allow video-realistic ones. This paper aims in this direction and presents Modular Codec Avatars (MCA), a method to generate hyper-realistic faces driven by the cameras in the VR headset. MCA extends traditional Codec Avatars (CA) by replacing the holistic models with a learned modular representation. It is important to note that traditional person-specific CAs are learned from few training samples, and typically lack robustness as well as limited expressiveness when transferring facial expressions. MCAs solve these issues by learning a modulated adaptive blending of different facial components as well as an exemplar-based latent alignment. We demonstrate that MCA achieves improved expressiveness and robustness w.r.t to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Speech and Audio Processing
