POCO: 3D Pose and Shape Estimation with Confidence
Sai Kumar Dwivedi, Cordelia Schmid, Hongwei Yi, Michael J. Black,, Dimitrios Tzionas

TL;DR
POCO introduces a framework for 3D human pose and shape estimation that also predicts confidence levels, improving downstream task reliability and enabling semi-supervised learning and video inpainting.
Contribution
It presents a novel method to estimate both 3D pose and confidence simultaneously, applicable to existing regressors, enhancing accuracy and utility.
Findings
Confidence estimates correlate with pose quality.
Using confidence for pseudo-labeling improves training.
Uncertainty helps identify occluded frames for inpainting.
Abstract
The regression of 3D Human Pose and Shape (HPS) from an image is becoming increasingly accurate. This makes the results useful for downstream tasks like human action recognition or 3D graphics. Yet, no regressor is perfect, and accuracy can be affected by ambiguous image evidence or by poses and appearance that are unseen during training. Most current HPS regressors, however, do not report the confidence of their outputs, meaning that downstream tasks cannot differentiate accurate estimates from inaccurate ones. To address this, we develop POCO, a novel framework for training HPS regressors to estimate not only a 3D human body, but also their confidence, in a single feed-forward pass. Specifically, POCO estimates both the 3D body pose and a per-sample variance. The key idea is to introduce a Dual Conditioning Strategy (DCS) for regressing uncertainty that is highly correlated to pose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · 3D Shape Modeling and Analysis
MethodsInpainting
