Learning Feature Pyramids for Human Pose Estimation
Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, Xiaogang Wang

TL;DR
This paper introduces a novel Pyramid Residual Module (PRMs) to improve scale invariance in deep convolutional neural networks for human pose estimation, achieving state-of-the-art results on standard benchmarks.
Contribution
We propose a new Pyramid Residual Module and extend weight initialization schemes for multi-branch networks, enhancing scale invariance and performance in pose estimation.
Findings
Achieves state-of-the-art results on pose estimation benchmarks.
Demonstrates the effectiveness of PRMs in handling scale variations.
Provides theoretical extension for weight initialization in multi-branch networks.
Abstract
Articulated human pose estimation is a fundamental yet challenging task in computer vision. The difficulty is particularly pronounced in scale variations of human body parts when camera view changes or severe foreshortening happens. Although pyramid methods are widely used to handle scale changes at inference time, learning feature pyramids in deep convolutional neural networks (DCNNs) is still not well explored. In this work, we design a Pyramid Residual Module (PRMs) to enhance the invariance in scales of DCNNs. Given input features, the PRMs learn convolutional filters on various scales of input features, which are obtained with different subsampling ratios in a multi-branch network. Moreover, we observe that it is inappropriate to adopt existing methods to initialize the weights of multi-branch networks, which achieve superior performance than plain networks in many tasks recently.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Hand Gesture Recognition Systems
