TL;DR
This paper introduces a relative information encoding approach for 3D human pose estimation that improves robustness to global motion and enhances local motion prediction by utilizing relative coordinates and temporal connections.
Contribution
It proposes a novel relative information encoding method combined with multi-stage optimization to improve 3D human pose estimation accuracy and robustness.
Findings
Outperforms state-of-the-art methods on two public datasets.
Enhances prediction of local motion with small movement ranges.
Increases robustness to global motion interference.
Abstract
Most of the existing 3D human pose estimation approaches mainly focus on predicting 3D positional relationships between the root joint and other human joints (local motion) instead of the overall trajectory of the human body (global motion). Despite the great progress achieved by these approaches, they are not robust to global motion, and lack the ability to accurately predict local motion with a small movement range. To alleviate these two problems, we propose a relative information encoding method that yields positional and temporal enhanced representations. Firstly, we encode positional information by utilizing relative coordinates of 2D poses to enhance the consistency between the input and output distribution. The same posture with different absolute 2D positions can be mapped to a common representation. It is beneficial to resist the interference of global motion on the prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
