GenHMR: Generative Human Mesh Recovery
Muhammad Usama Saleem, Ekkasit Pinyoanuntapong, Pu Wang, Hongfei Xue,, Srijan Das, Chen Chen

TL;DR
GenHMR introduces a probabilistic, generative approach to monocular human mesh recovery, explicitly modeling uncertainties and improving accuracy over existing deterministic methods.
Contribution
It presents a novel generative framework with a pose tokenizer and masked transformer, enhancing 3D human pose estimation from single images by modeling uncertainties.
Findings
Outperforms state-of-the-art methods on benchmark datasets
Effectively reduces 3D reconstruction uncertainties
Incorporates 2D pose-guided refinement for better accuracy
Abstract
Human mesh recovery (HMR) is crucial in many computer vision applications; from health to arts and entertainment. HMR from monocular images has predominantly been addressed by deterministic methods that output a single prediction for a given 2D image. However, HMR from a single image is an ill-posed problem due to depth ambiguity and occlusions. Probabilistic methods have attempted to address this by generating and fusing multiple plausible 3D reconstructions, but their performance has often lagged behind deterministic approaches. In this paper, we introduce GenHMR, a novel generative framework that reformulates monocular HMR as an image-conditioned generative task, explicitly modeling and mitigating uncertainties in the 2D-to-3D mapping process. GenHMR comprises two key components: (1) a pose tokenizer to convert 3D human poses into a sequence of discrete tokens in a latent space, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsForensic Entomology and Diptera Studies · Textile materials and evaluations
MethodsALIGN
