On the role of depth predictions for 3D human pose estimation
Alec Diaz-Arias, Mitchell Messmore, Dmitriy Shin, and Stephen Baek

TL;DR
This paper demonstrates that incorporating estimated depth information into 2D joint locations significantly improves 3D human pose estimation accuracy from monocular images, even with noisy depth data.
Contribution
The authors introduce a system that uses 2D joint locations and their estimated depths to predict 3D poses, highlighting the importance of depth cues in resolving depth ambiguity.
Findings
Significant correlation between predicted depth and 3D coordinates despite noise
State-of-the-art results on H3.6M validation set due to depth input
System operates in real-time and can be combined with existing detectors
Abstract
Following the successful application of deep convolutional neural networks to 2d human pose estimation, the next logical problem to solve is 3d human pose estimation from monocular images. While previous solutions have shown some success, they do not fully utilize the depth information from the 2d inputs. With the goal of addressing this depth ambiguity, we build a system that takes 2d joint locations as input along with their estimated depth value and predicts their 3d positions in camera coordinates. Given the inherent noise and inaccuracy from estimating depth maps from monocular images, we perform an extensive statistical analysis showing that given this noise there is still a statistically significant correlation between the predicted depth values and the third coordinate of camera coordinates. We further explain how the state-of-the-art results we achieve on the H3.6M validation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Diabetic Foot Ulcer Assessment and Management · Video Surveillance and Tracking Methods
