Neural Descent for Visual 3D Human Pose and Shape

Andrei Zanfir; Eduard Gabriel Bazavan; Mihai Zanfir; William T.; Freeman; Rahul Sukthankar; Cristian Sminchisescu

arXiv:2008.06910·cs.CV·June 15, 2021

Neural Descent for Visual 3D Human Pose and Shape

Andrei Zanfir, Eduard Gabriel Bazavan, Mihai Zanfir, William T., Freeman, Rahul Sukthankar, Cristian Sminchisescu

PDF

TL;DR

This paper introduces HUND, a neural descent approach for reconstructing 3D human pose and shape from RGB images, leveraging a statistical model and self-supervised learning to improve efficiency and versatility.

Contribution

It proposes a novel neural descent method that avoids second-order derivatives and expensive optimization, enabling flexible and self-supervised 3D human reconstruction.

Findings

01

Achieves competitive results on H3.6M and 3DPW datasets.

02

Supports different operating regimes including self-supervised learning.

03

Produces high-quality 3D reconstructions in diverse, in-the-wild images.

Abstract

We present deep neural network methodology to reconstruct the 3d pose and shape of people, given an input RGB image. We rely on a recently introduced, expressivefull body statistical 3d human model, GHUM, trained end-to-end, and learn to reconstruct its pose and shape state in a self-supervised regime. Central to our methodology, is a learning to learn and optimize approach, referred to as HUmanNeural Descent (HUND), which avoids both second-order differentiation when training the model parameters,and expensive state gradient descent in order to accurately minimize a semantic differentiable rendering loss at test time. Instead, we rely on novel recurrent stages to update the pose and shape parameters such that not only losses are minimized effectively, but the process is meta-regularized in order to ensure end-progress. HUND's symmetry between training and testing makes it the first 3d…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.