Heterogeneous Multi-task Learning for Human Pose Estimation with Deep   Convolutional Neural Network

Sijin Li; Zhi-Qiang Liu; Antoni B. Chan

arXiv:1406.3474·cs.CV·June 16, 2014·46 cites

Heterogeneous Multi-task Learning for Human Pose Estimation with Deep Convolutional Neural Network

Sijin Li, Zhi-Qiang Liu, Antoni B. Chan

PDF

Open Access

TL;DR

This paper introduces a multi-task deep learning framework for human pose estimation that jointly learns pose regression and body-part detection, improving accuracy and interpretability.

Contribution

It presents a novel heterogeneous multi-task learning approach combining pose estimation and body-part detection within a deep CNN architecture.

Findings

01

Achieves state-of-the-art results on multiple datasets.

02

Including body-part detection regularizes the network effectively.

03

Middle-layer neurons are tuned to specific body parts.

Abstract

We propose an heterogeneous multi-task learning framework for human pose estimation from monocular image with deep convolutional neural network. In particular, we simultaneously learn a pose-joint regressor and a sliding-window body-part detector in a deep network architecture. We show that including the body-part detection task helps to regularize the network, directing it to converge to a good solution. We report competitive and state-of-art results on several data sets. We also empirically show that the learned neurons in the middle layer of our network are tuned to localized body parts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Video Surveillance and Tracking Methods