GAT-NeRF: Geometry-Aware-Transformer Enhanced Neural Radiance Fields for High-Fidelity 4D Facial Avatars
Zhe Chang, Haodong Jin, Ying Sun, Yan Song, Hui Yu

TL;DR
GAT-NeRF is a novel hybrid neural radiance field framework that integrates a Transformer mechanism with NeRF to enhance high-fidelity 4D facial avatar reconstruction, capturing fine facial details from monocular videos.
Contribution
This work introduces GAT-NeRF, combining a coordinate-aligned MLP with a Geometry-Aware-Transformer to effectively model complex facial details using multi-modal geometric priors.
Findings
Achieves state-of-the-art visual fidelity in 4D facial avatars.
Effectively captures high-frequency facial details like wrinkles and scars.
Demonstrates superior performance over existing methods in experiments.
Abstract
High-fidelity 4D dynamic facial avatar reconstruction from monocular video is a critical yet challenging task, driven by increasing demands for immersive virtual human applications. While Neural Radiance Fields (NeRF) have advanced scene representation, their capacity to capture high-frequency facial details, such as dynamic wrinkles and subtle textures from information-constrained monocular streams, requires significant enhancement. To tackle this challenge, we propose a novel hybrid neural radiance field framework, called Geometry-Aware-Transformer Enhanced NeRF (GAT-NeRF) for high-fidelity and controllable 4D facial avatar reconstruction, which integrates the Transformer mechanism into the NeRF pipeline. GAT-NeRF synergistically combines a coordinate-aligned Multilayer Perceptron (MLP) with a lightweight Transformer module, termed as Geometry-Aware-Transformer (GAT) due to its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Facial Nerve Paralysis Treatment and Research
