LAM: Large Avatar Model for One-shot Animatable Gaussian Head
Yisheng He, Xiaodong Gu, Xiaodan Ye, Chao Xu, Zhengyi Zhao, Yuan Dong,, Weihao Yuan, Zilong Dong, Liefeng Bo

TL;DR
LAM introduces a single-pass, animatable Gaussian head model from a single image that enables real-time reenactment and rendering without additional training or neural networks, suitable for diverse platforms.
Contribution
The paper presents a novel approach to generate immediately animatable Gaussian heads from a single image using a Transformer-based attribute generator, eliminating the need for extensive training or auxiliary networks.
Findings
LAM outperforms existing methods on benchmarks.
Enables real-time animation on mobile devices.
Provides seamless integration into rendering pipelines.
Abstract
We present LAM, an innovative Large Avatar Model for animatable Gaussian head reconstruction from a single image. Unlike previous methods that require extensive training on captured video sequences or rely on auxiliary neural networks for animation and rendering during inference, our approach generates Gaussian heads that are immediately animatable and renderable. Specifically, LAM creates an animatable Gaussian head in a single forward pass, enabling reenactment and rendering without additional networks or post-processing steps. This capability allows for seamless integration into existing rendering pipelines, ensuring real-time animation and rendering across a wide range of platforms, including mobile phones. The centerpiece of our framework is the canonical Gaussian attributes generator, which utilizes FLAME canonical points as queries. These points interact with multi-scale image…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Human Motion and Animation
MethodsAttention Is All You Need · Absolute Position Encodings · Dense Connections · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer
