TL;DR
This paper introduces a lightweight super-resolution head for human pose estimation that reduces quantization errors by predicting high-resolution heatmaps, improving accuracy without complex post-processing.
Contribution
It reformulates heatmap prediction as a super-resolution task, enabling high-resolution heatmaps with minimal additional complexity, applicable to various pose estimation frameworks.
Findings
Outperforms existing heatmap-based methods on COCO, MPII, and CrowdPose datasets.
Effectively reduces quantization errors without heavy post-processing.
Lightweight SR head is versatile for top-down and bottom-up methods.
Abstract
Heatmap-based methods have become the mainstream method for pose estimation due to their superior performance. However, heatmap-based approaches suffer from significant quantization errors with downscale heatmaps, which result in limited performance and the detrimental effects of intermediate supervision. Previous heatmap-based methods relied heavily on additional post-processing to mitigate quantization errors. Some heatmap-based approaches improve the resolution of feature maps by using multiple costly upsampling layers to improve localization precision. To solve the above issues, we creatively view the backbone network as a degradation process and thus reformulate the heatmap prediction as a Super-Resolution (SR) task. We first propose the SR head, which predicts heatmaps with a spatial resolution higher than the input feature maps (or even consistent with the input image) by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsHeatmap
