Bias-Compensated Integral Regression for Human Pose Estimation
Kerui Gu, Linlin Yang, Michael Bi Mi, Angela Yao

TL;DR
This paper identifies a bias in integral regression for pose estimation and proposes Bias Compensated Integral Regression (BCIR), which improves accuracy and training speed, making it competitive with detection-based methods.
Contribution
The paper introduces BCIR, a novel framework that compensates for bias in integral regression and incorporates a Gaussian prior loss for better accuracy and faster training.
Findings
BCIR outperforms original integral regression in accuracy.
BCIR trains faster than traditional integral regression.
BCIR achieves competitive results with state-of-the-art detection methods.
Abstract
In human and hand pose estimation, heatmaps are a crucial intermediate representation for a body or hand keypoint. Two popular methods to decode the heatmap into a final joint coordinate are via an argmax, as done in heatmap detection, or via softmax and expectation, as done in integral regression. Integral regression is learnable end-to-end, but has lower accuracy than detection. This paper uncovers an induced bias from integral regression that results from combining the softmax and the expectation operation. This bias often forces the network to learn degenerately localized heatmaps, obscuring the keypoint's true underlying distribution and leads to lower accuracies. Training-wise, by investigating the gradients of integral regression, we show that the implicit guidance of integral regression to update the heatmap makes it slower to converge than detection. To counter the above two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Anomaly Detection Techniques and Applications
MethodsSoftmax · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Heatmap
