TL;DR
This paper introduces metric-scale truncation-robust heatmaps for absolute 3D human pose estimation, enabling direct, complete, and scale-aware pose predictions without external scale information or heuristics.
Contribution
The authors propose a novel 3D heatmap representation in metric space and a combined architecture that improves absolute pose estimation accuracy.
Findings
Achieves state-of-the-art results on Human3.6M, MPI-INF-3DHP, and MuPoTS-3D datasets.
Enables direct metric-scale pose estimation without external scale cues.
Improves localization of truncated or partially visible body parts.
Abstract
Heatmap representations have formed the basis of human pose estimation systems for many years, and their extension to 3D has been a fruitful line of recent research. This includes 2.5D volumetric heatmaps, whose X and Y axes correspond to image space and Z to metric depth around the subject. To obtain metric-scale predictions, 2.5D methods need a separate post-processing step to resolve scale ambiguity. Further, they cannot localize body joints outside the image boundaries, leading to incomplete estimates for truncated images. To address these limitations, we propose metric-scale truncation-robust (MeTRo) volumetric heatmaps, whose dimensions are all defined in metric 3D space, instead of being aligned with image space. This reinterpretation of heatmap dimensions allows us to directly estimate complete, metric-scale poses without test-time knowledge of distance or relying on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsHeatmap
