HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation
Bowen Cheng, Bin Xiao, Jingdong Wang, Honghui Shi, Thomas S. Huang,, and Lei Zhang

TL;DR
HigherHRNet introduces a scale-aware, multi-resolution approach for bottom-up human pose estimation, significantly improving accuracy for small persons and crowded scenes by leveraging high-resolution feature pyramids.
Contribution
It proposes a novel scale-aware representation learning method with multi-resolution supervision and aggregation, advancing bottom-up pose estimation accuracy and robustness.
Findings
Outperforms previous methods by 2.5% AP on COCO for medium persons
Achieves 70.5% AP on COCO test-dev without post-processing
Surpasses top-down methods on CrowdPose test with 67.6% AP
Abstract
Bottom-up human pose estimation methods have difficulties in predicting the correct pose for small persons due to challenges in scale variation. In this paper, we present HigherHRNet: a novel bottom-up human pose estimation method for learning scale-aware representations using high-resolution feature pyramids. Equipped with multi-resolution supervision for training and multi-resolution aggregation for inference, the proposed approach is able to solve the scale variation challenge in bottom-up multi-person pose estimation and localize keypoints more precisely, especially for small person. The feature pyramid in HigherHRNet consists of feature map outputs from HRNet and upsampled higher-resolution outputs through a transposed convolution. HigherHRNet outperforms the previous best bottom-up method by 2.5% AP for medium person on COCO test-dev, showing its effectiveness in handling scale…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation· youtube
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation· youtube
Taxonomy
TopicsHuman Pose and Action Recognition · Anomaly Detection Techniques and Applications · Hand Gesture Recognition Systems
MethodsResidual Connection · Convolution · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · HRNet
