Human Pose Regression by Combining Indirect Part Detection and   Contextual Information

Diogo C. Luvizon; Hedi Tabia; David Picard

arXiv:1710.02322·cs.CV·October 9, 2017

Human Pose Regression by Combining Indirect Part Detection and Contextual Information

Diogo C. Luvizon, Hedi Tabia, David Picard

PDF

1 Repo

TL;DR

This paper introduces a fully differentiable end-to-end human pose regression method that leverages indirect heat map learning and contextual information, achieving state-of-the-art results on challenging datasets.

Contribution

It presents a novel Soft-argmax based regression framework that learns heat maps indirectly without extra ground truth, enabling seamless integration of contextual cues.

Findings

01

Achieved top performance on LSP and MPII datasets.

02

Outperformed existing regression methods in human pose estimation.

03

Comparable to detection-based approaches in accuracy.

Abstract

In this paper, we propose an end-to-end trainable regression approach for human pose estimation from still images. We use the proposed Soft-argmax function to convert feature maps directly to joint coordinates, resulting in a fully differentiable framework. Our method is able to learn heat maps representations indirectly, without additional steps of artificial ground truth generation. Consequently, contextual information can be included to the pose predictions in a seamless way. We evaluated our method on two very challenging datasets, the Leeds Sports Poses (LSP) and the MPII Human Pose datasets, reaching the best performance among all the existing regression methods and comparable results to the state-of-the-art detection based approaches.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dluvizon/pose-regression
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.