Attention Mesh: High-fidelity Face Mesh Prediction in Real-time
Ivan Grishchenko, Artsiom Ablavatski, Yury Kartynnik, Karthik, Raveendran, Matthias Grundmann

TL;DR
Attention Mesh is a fast, lightweight neural network architecture for real-time 3D face mesh prediction on mobile devices, enabling high-accuracy facial landmark applications like AR makeup and eye tracking.
Contribution
It introduces a unified, attention-based network architecture that matches the accuracy of multi-stage methods while being 30% faster for real-time face mesh prediction.
Findings
Runs at over 50 FPS on a Pixel 2 phone
Achieves high accuracy comparable to multi-stage approaches
Enables real-time AR applications
Abstract
We present Attention Mesh, a lightweight architecture for 3D face mesh prediction that uses attention to semantically meaningful regions. Our neural network is designed for real-time on-device inference and runs at over 50 FPS on a Pixel 2 phone. Our solution enables applications like AR makeup, eye tracking and AR puppeteering that rely on highly accurate landmarks for eye and lips regions. Our main contribution is a unified network architecture that achieves the same accuracy on facial landmarks as a multi-stage cascaded approach, while being 30 percent faster.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
MethodsGemini Customer Care Number +1-888-829-0881
