RSPose: Ranking Based Losses for Human Pose Estimation
Muhammed Can Keles, Bedrettin Cetinkaya, Sinan Kalkan, Emre Akbas

TL;DR
This paper introduces ranking-based loss functions for human pose estimation that better align with evaluation metrics, improve confidence score correlation, and outperform previous methods on multiple datasets.
Contribution
The authors propose novel ranking-based loss functions specifically designed for heatmap-based human pose estimation, addressing limitations of traditional MSE and KL losses.
Findings
RSPose improves mAP scores on COCO, CrowdPose, and MPII datasets.
The proposed losses increase correlation between confidence scores and localization quality.
RSPose achieves state-of-the-art performance on COCO-val with ViTPose-H.
Abstract
While heatmap-based human pose estimation methods have shown strong performance, they suffer from three main problems: (P1) "Commonly used Mean Squared Error (MSE)" Loss may not always improve joint localization because it penalizes all pixel deviations equally, without focusing explicitly on sharpening and correctly localizing the peak corresponding to the joint; (P2) heatmaps are spatially and class-wise imbalanced; and, (P3) there is a discrepancy between the evaluation metric (i.e., mAP) and the loss functions. We propose ranking-based losses to address these issues. Both theoretically and empirically, we show that our proposed losses are superior to commonly used heatmap losses (MSE, KL-Divergence). Our losses considerably increase the correlation between confidence scores and localization qualities, which is desirable because higher correlation leads to more accurate instance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Robot Manipulation and Learning
