Monocular Human Shape and Pose with Dense Mesh-borne Local Image   Features

Shubhendu Jena; Franck Multon; Adnane Boukhayma

arXiv:2111.05319·cs.CV·August 12, 2022

Monocular Human Shape and Pose with Dense Mesh-borne Local Image Features

Shubhendu Jena, Franck Multon, Adnane Boukhayma

PDF

Open Access

TL;DR

This paper introduces a novel approach for human shape and pose estimation from monocular images by utilizing dense, vertex-specific local image features instead of global features, leading to improved accuracy.

Contribution

It is the first to incorporate pixel-aligned local image features per mesh vertex using DensePose, enhancing graph convolutional network performance for human shape and pose estimation.

Findings

01

Local features outperform global features in accuracy.

02

The method achieves competitive results on standard benchmarks.

03

Using dense local features improves pose and shape estimation quality.

Abstract

We propose to improve on graph convolution based approaches for human shape and pose estimation from monocular input, using pixel-aligned local image features. Given a single input color image, existing graph convolutional network (GCN) based techniques for human shape and pose estimation use a single convolutional neural network (CNN) generated global image feature appended to all mesh vertices equally to initialize the GCN stage, which transforms a template T-posed mesh into the target pose. In contrast, we propose for the first time the idea of using local image features per vertex. These features are sampled from the CNN image feature maps by utilizing pixel-to-mesh correspondences generated with DensePose. Our quantitative and qualitative results on standard benchmarks show that using local features improves on global ones and leads to competitive performances with respect to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · 3D Shape Modeling and Analysis · Robot Manipulation and Learning

MethodsGraph Convolutional Network · Convolution