TL;DR
This paper introduces a hierarchical probabilistic model for 3D human shape and pose estimation from images, capturing uncertainty and multiple plausible reconstructions by leveraging structured distributions and a differentiable sampling approach.
Contribution
It proposes a novel hierarchical distribution framework for 3D human pose and shape, incorporating a differentiable rejection sampler for improved image consistency.
Findings
Competitive accuracy on SSP-3D and 3DPW datasets
Provides meaningful quantification of prediction uncertainty
Enables sampling of multiple plausible 3D reconstructions
Abstract
This paper addresses the problem of 3D human body shape and pose estimation from an RGB image. This is often an ill-posed problem, since multiple plausible 3D bodies may match the visual evidence present in the input - particularly when the subject is occluded. Thus, it is desirable to estimate a distribution over 3D body shape and pose conditioned on the input image instead of a single 3D reconstruction. We train a deep neural network to estimate a hierarchical matrix-Fisher distribution over relative 3D joint rotation matrices (i.e. body pose), which exploits the human body's kinematic tree structure, as well as a Gaussian distribution over SMPL body shape parameters. To further ensure that the predicted shape and pose distributions match the visual evidence in the input image, we implement a differentiable rejection sampler to impose a reprojection loss between ground-truth 2D joint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
