TL;DR
This paper introduces Residual Log-likelihood Estimation (RLE), a novel regression-based approach for human pose estimation that models output distribution more accurately, outperforming traditional heatmap methods in accuracy and efficiency.
Contribution
The paper proposes RLE, a new regression paradigm that captures output distribution changes, compatible with flow models, improving pose estimation performance significantly.
Findings
12.4 mAP improvement on MSCOCO without test-time overhead
Outperforms heatmap-based methods in multi-person pose estimation
Effective and flexible regression method for human pose tasks
Abstract
Heatmap-based methods dominate in the field of human pose estimation by modelling the output distribution through likelihood heatmaps. In contrast, regression-based methods are more efficient but suffer from inferior performance. In this work, we explore maximum likelihood estimation (MLE) to develop an efficient and effective regression-based methods. From the perspective of MLE, adopting different regression losses is making different assumptions about the output density function. A density function closer to the true distribution leads to a better regression performance. In light of this, we propose a novel regression paradigm with Residual Log-likelihood Estimation (RLE) to capture the underlying output distribution. Concretely, RLE learns the change of the distribution instead of the unreferenced underlying distribution to facilitate the training process. With the proposed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
