Deep Pose Consensus Networks
Geonho Cha, Minsik Lee, Jungchan Cho, Songhwai Oh

TL;DR
This paper introduces a multi-part hypothesis framework for 3D human pose estimation from a single image, improving accuracy by combining multiple joint group estimations through robust optimization.
Contribution
It proposes a novel multi-part hypothesis approach with a sampling scheme and end-to-end training, outperforming existing methods on benchmark datasets.
Findings
Achieved state-of-the-art results on Human3.6M and HumanEva datasets.
Demonstrated the effectiveness of multi-part hypotheses over single estimators.
Enhanced robustness and accuracy in 3D pose estimation.
Abstract
In this paper, we address the problem of estimating a 3D human pose from a single image, which is important but difficult to solve due to many reasons, such as self-occlusions, wild appearance changes, and inherent ambiguities of 3D estimation from a 2D cue. These difficulties make the problem ill-posed, which have become requiring increasingly complex estimators to enhance the performance. On the other hand, most existing methods try to handle this problem based on a single complex estimator, which might not be good solutions. In this paper, to resolve this issue, we propose a multiple-partial-hypothesis-based framework for the problem of estimating 3D human pose from a single image, which can be fine-tuned in an end-to-end fashion. We first select several joint groups from a human joint model using the proposed sampling scheme, and estimate the 3D poses of each joint group separately…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging
