FlexAvatar: Learning Complete 3D Head Avatars with Partial Supervision

Tobias Kirschstein; Simon Giebenhain; Matthias Nie{\ss}ner

arXiv:2512.15599·cs.CV·April 9, 2026

FlexAvatar: Learning Complete 3D Head Avatars with Partial Supervision

Tobias Kirschstein, Simon Giebenhain, Matthias Nie{\ss}ner

PDF

1 Repo

TL;DR

FlexAvatar is a transformer-based method that creates complete 3D head avatars from a single image by effectively combining monocular and multi-view data during training.

Contribution

It introduces learnable data source tokens enabling unified training on monocular and multi-view datasets for high-quality 3D head avatar reconstruction.

Findings

01

Achieves full 3D head reconstructions from monocular images.

02

Enables identity interpolation and flexible fitting.

03

Outperforms existing methods in view extrapolation and facial animation.

Abstract

We introduce FlexAvatar, a method for creating high-quality and complete 3D head avatars from a single image. A core challenge lies in the limited availability of multi-view data and the tendency of monocular training to yield incomplete 3D head reconstructions. We identify the root cause of this issue as the entanglement between driving signal and target viewpoint when learning from monocular videos. To address this, we propose a transformer-based 3D portrait animation model with learnable data source tokens, so-called bias sinks, which enables unified training across monocular and multi-view datasets. This design leverages the strengths of both data sources during inference: strong generalization from monocular data and full 3D completeness from multi-view supervision. Furthermore, our training procedure yields a smooth latent avatar space that facilitates identity interpolation and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://tobias-kirschstein.github.io/flexavatar
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.