Learning Human Poses from Actions
Aditya Arun, C.V. Jawahar, M. Pawan Kumar

TL;DR
This paper introduces a semi-supervised learning approach for human pose estimation in images, leveraging a small set of fully labeled images and a larger set with only action labels to reduce annotation costs while maintaining accuracy.
Contribution
It proposes a probabilistic framework that jointly models pose uncertainty conditioned on actions and unconditioned pose predictions, improving efficiency and accuracy in pose estimation.
Findings
Outperforms baseline methods on MPII dataset
Reduces annotation costs without significant accuracy loss
Uses deep probabilistic networks for pose estimation
Abstract
We consider the task of learning to estimate human pose in still images. In order to avoid the high cost of full supervision, we propose to use a diverse data set, which consists of two types of annotations: (i) a small number of images are labeled using the expensive ground-truth pose; and (ii) other images are labeled using the inexpensive action label. As action information helps narrow down the pose of a human, we argue that this approach can help reduce the cost of training without significantly affecting the accuracy. To demonstrate this we design a probabilistic framework that employs two distributions: (i) a conditional distribution to model the uncertainty over the human pose given the image and the action; and (ii) a prediction distribution, which provides the pose of an image without using any action information. We jointly estimate the parameters of the two aforementioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Advanced Vision and Imaging
