TL;DR
This paper introduces a prior-guided learning framework for egocentric whole-body human mesh recovery from monocular head-mounted camera images, improving accuracy by leveraging multiple priors and addressing fisheye distortions.
Contribution
It proposes a novel framework that reconstructs detailed whole-body meshes from a single egocentric image, utilizing optimized pseudo-GT and multiple priors including an exocentric HMR model and diffusion-based pose prior.
Findings
Achieves superior whole-body reconstruction accuracy on egocentric benchmarks.
Demonstrates pseudo-GT is more accurate than existing regression-based pseudo-GT.
Effectively handles fisheye distortions with a deterministic undistortion module.
Abstract
Egocentric human mesh recovery (HMR) from monocular head-mounted cameras is increasingly important for AR/VR applications, but remains challenging due to the lack of reliable ground-truth (GT) annotations based on parametric human body models such as SMPL and SMPL-X for real egocentric images. Existing egocentric HMR methods typically rely on pseudo-GT and focus on body pose estimation, which limits their ability to recover fine-grained whole-body details such as hands and face. We study egocentric whole-body human mesh recovery and propose a prior-guided learning framework that reconstructs whole-body meshes from a single egocentric image. We construct more accurate optimization-based pseudo-GT aligned with 3D joint supervision, and leverage multiple priors by adapting an exocentric HMR foundation model together with a diffusion-based pose prior. A deterministic undistortion module is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
