WildGHand: Learning Anti-Perturbation Gaussian Hand Avatars from Monocular In-the-Wild Videos

Hanhui Li; Xuan Huang; Wanquan Liu; Yuhao Cheng; Long Chen; Yiqiang Yan; Xiaodan Liang; Chenqiang Gao

arXiv:2602.20556·cs.CV·February 25, 2026

WildGHand: Learning Anti-Perturbation Gaussian Hand Avatars from Monocular In-the-Wild Videos

Hanhui Li, Xuan Huang, Wanquan Liu, Yuhao Cheng, Long Chen, Yiqiang Yan, Xiaodan Liang, Chenqiang Gao

PDF

Open Access

TL;DR

WildGHand introduces a novel optimization framework for high-fidelity 3D hand avatar reconstruction from monocular in-the-wild videos, effectively handling severe perturbations like occlusions, extreme poses, and motion blur.

Contribution

The paper proposes WildGHand, a new perturbation-aware optimization method with a dynamic disentanglement module for robust 3D hand avatar reconstruction in challenging real-world conditions.

Findings

01

Achieves up to 15.8% improvement in PSNR.

02

Reduces LPIPS by 23.1%.

03

Outperforms existing methods on curated in-the-wild datasets.

Abstract

Despite recent progress in 3D hand reconstruction from monocular videos, most existing methods rely on data captured in well-controlled environments and therefore degrade in real-world settings with severe perturbations, such as hand-object interactions, extreme poses, illumination changes, and motion blur. To tackle these issues, we introduce WildGHand, an optimization-based framework that enables self-adaptive 3D Gaussian splatting on in-the-wild videos and produces high-fidelity hand avatars. WildGHand incorporates two key components: (i) a dynamic perturbation disentanglement module that explicitly represents perturbations as time-varying biases on 3D Gaussian attributes during optimization, and (ii) a perturbation-aware optimization strategy that generates per-frame anisotropic weighted masks to guide optimization. Together, these components allow the framework to identify and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Generative Adversarial Networks and Image Synthesis · Robot Manipulation and Learning