OriNet: A Fully Convolutional Network for 3D Human Pose Estimation
Chenxu Luo, Xiao Chu, Alan Yuille

TL;DR
This paper introduces OriNet, a fully convolutional network that estimates 3D human poses from monocular images using limb orientations and bounding boxes, achieving strong results and good generalization.
Contribution
The paper presents a novel approach combining limb orientations with bounding boxes for 3D pose estimation, improving accuracy and robustness without extra constraints.
Findings
Achieves good results on large-scale benchmarks.
Generalizes well to new scenes.
Robust to bounding box inaccuracies.
Abstract
In this paper, we propose a fully convolutional network for 3D human pose estimation from monocular images. We use limb orientations as a new way to represent 3D poses and bind the orientation together with the bounding box of each limb region to better associate images and predictions. The 3D orientations are modeled jointly with 2D keypoint detections. Without additional constraints, this simple method can achieve good results on several large-scale benchmarks. Further experiments show that our method can generalize well to novel scenes and is robust to inaccurate bounding boxes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Diabetic Foot Ulcer Assessment and Management · Gait Recognition and Analysis
