Feed-forward Gaussian Registration for Head Avatar Creation and Editing
Malte Prinzler, Paulo Gotardo, Siyu Tang, Timo Bolkart

TL;DR
MATCH is a fast, end-to-end Gaussian registration method that enables high-quality head avatar creation and editing from multi-view images in just 0.5 seconds per frame, significantly reducing previous processing time.
Contribution
The paper introduces a novel transformer-based Gaussian registration approach with a registration-guided attention block for efficient, correspondence-based head avatar creation and editing.
Findings
MATCH outperforms existing methods in view synthesis and registration.
Avatar creation is accelerated by a factor of 10.
The method enables applications like expression transfer and semantic editing.
Abstract
We present MATCH (Multi-view Avatars from Topologically Corresponding Heads), a multi-view Gaussian registration method for high-quality head avatar creation and editing. State-of-the-art multi-view head avatar methods require time-consuming head tracking followed by expensive avatar optimization, often resulting in a total creation time of more than one day. MATCH, in contrast, directly predicts Gaussian splat textures in correspondence from calibrated multi-view images in just 0.5 seconds per frame, without requiring data preprocessing. The learned intra-subject correspondence across frames enables fast creation of personalized head avatars, while correspondence across subjects supports applications such as expression transfer, optimization-free tracking, semantic editing, and identity interpolation. We establish these correspondences end-to-end using a transformer-based model that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Social Robot Interaction and HRI
