Human pose estimation via Convolutional Part Heatmap Regression

Adrian Bulat; Georgios Tzimiropoulos

arXiv:1609.01743·cs.CV·August 28, 2018

Human pose estimation via Convolutional Part Heatmap Regression

Adrian Bulat, Georgios Tzimiropoulos

PDF

1 Repo

TL;DR

This paper introduces a CNN cascade architecture for human pose estimation that effectively handles occlusions by combining part detection heatmaps with regression, improving accuracy on standard datasets.

Contribution

A novel CNN cascade architecture that learns part relationships and spatial context, robustly inferring pose even with severe occlusions.

Findings

01

Achieves top performance on MPII dataset.

02

Effectively handles occlusions through heatmap-guided regression.

03

Flexible integration with various CNN architectures.

Abstract

This paper is on human pose estimation using Convolutional Neural Networks. Our main contribution is a CNN cascaded architecture specifically designed for learning part relationships and spatial context, and robustly inferring pose even for the case of severe part occlusions. To this end, we propose a detection-followed-by-regression CNN cascade. The first part of our cascade outputs part detection heatmaps and the second part performs regression on these heatmaps. The benefits of the proposed architecture are multi-fold: It guides the network where to focus in the image and effectively encodes part constraints and context. More importantly, it can effectively cope with occlusions because part detection heatmaps for occluded parts provide low confidence scores which subsequently guide the regression part of our network to rely on contextual information in order to predict the location…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

1adrianb/human-pose-estimation
torchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling