High-Fidelity Mobile Avatars with Pruned Local Blendshapes

Youyi Zhan; He Wang; Tianjia Shao; Kun Zhou

arXiv:2605.01854·cs.CV·May 5, 2026

High-Fidelity Mobile Avatars with Pruned Local Blendshapes

Youyi Zhan, He Wang, Tianjia Shao, Kun Zhou

PDF

TL;DR

This paper introduces a mobile-friendly method for high-fidelity human avatar reconstruction from multi-view video, emphasizing local linear blendshapes to balance detail and computational efficiency.

Contribution

It presents a novel local linear blendshape approach that reduces computation and model size while maintaining high avatar detail on mobile devices.

Findings

01

Achieves 120 FPS at 2K resolution on mobile devices.

02

Produces high-quality avatars with better details than previous methods.

03

Uses WebGPU for cross-device implementation.

Abstract

We propose a method to reconstruct high-fidelity human avatars from multi-view video that can run on mobile devices. Many works can model high-quality Gaussian-based full-body avatars from multi-view video. However, these methods require heavy computation to obtain pose-dependent appearance, making deployment on mobile devices very difficult. Recent methods distill from pretrained models and model pose-dependent nonlinear Gaussian attributes by linearly combining global pose features with blendshapes. Although they can run on mobile devices, they suffer some loss of detail. We observe that nearby Gaussians are often highly correlated within a local region of the body, and can be linearly modeled with less error. Therefore, we use local linear blendshapes in small body parts to capture global nonlinear changes of Gaussian attributes. To further reduce computation and model size, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.