LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Yisheng He; Xiaodong Gu; Xiaodan Ye; Chao Xu; Zhengyi Zhao; Yuan Dong,; Weihao Yuan; Zilong Dong; Liefeng Bo

arXiv:2502.17796·cs.CV·April 7, 2025

LAM: Large Avatar Model for One-shot Animatable Gaussian Head

Yisheng He, Xiaodong Gu, Xiaodan Ye, Chao Xu, Zhengyi Zhao, Yuan Dong,, Weihao Yuan, Zilong Dong, Liefeng Bo

PDF

Open Access 2 Models

TL;DR

LAM introduces a single-pass, animatable Gaussian head model from a single image that enables real-time reenactment and rendering without additional training or neural networks, suitable for diverse platforms.

Contribution

The paper presents a novel approach to generate immediately animatable Gaussian heads from a single image using a Transformer-based attribute generator, eliminating the need for extensive training or auxiliary networks.

Findings

01

LAM outperforms existing methods on benchmarks.

02

Enables real-time animation on mobile devices.

03

Provides seamless integration into rendering pipelines.

Abstract

We present LAM, an innovative Large Avatar Model for animatable Gaussian head reconstruction from a single image. Unlike previous methods that require extensive training on captured video sequences or rely on auxiliary neural networks for animation and rendering during inference, our approach generates Gaussian heads that are immediately animatable and renderable. Specifically, LAM creates an animatable Gaussian head in a single forward pass, enabling reenactment and rendering without additional networks or post-processing steps. This capability allows for seamless integration into existing rendering pipelines, ensuring real-time animation and rendering across a wide range of platforms, including mobile phones. The centerpiece of our framework is the canonical Gaussian attributes generator, which utilizes FLAME canonical points as queries. These points interact with multi-scale image…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Human Motion and Animation

MethodsAttention Is All You Need · Absolute Position Encodings · Dense Connections · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer