Retrieval-Augmented Gaussian Avatars: Improving Expression Generalization
Matan Levy, Gavriel Habib, Issar Tzachor, Dvir Samuel, Rami Ben-Ari, Nir Darshan, Or Litany, Dani Lischinski

TL;DR
This paper introduces RAF, a retrieval-augmented training method for template-free head avatars, which enhances expression generalization and robustness by leveraging a large expression bank during training.
Contribution
The paper proposes a novel retrieval-augmented training approach that improves expression coverage and identity-expression decoupling without extra annotations or architectural changes.
Findings
RAF improves expression fidelity on the NeRSemble benchmark.
Retrieval augmentation increases expression diversity and robustness.
User study confirms retrieved expressions are perceptually similar.
Abstract
Template-free animatable head avatars can achieve high visual fidelity by learning expression-dependent facial deformation directly from a subject's capture, avoiding parametric face templates and hand-designed blendshape spaces. However, since learned deformation is supervised only by the expressions observed for a single identity, these models suffer from limited expression coverage and often struggle when driven by motions that deviate from the training distribution. We introduce RAF (Retrieval-Augmented Faces), a simple training-time augmentation designed for template-free head avatars that learn deformation from data. RAF constructs a large unlabeled expression bank and, during training, replaces a subset of the subject's expression features with nearest-neighbor expressions retrieved from this bank while still reconstructing the subject's original frames. This exposes the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Emotion and Mood Recognition · Social Robot Interaction and HRI
