TL;DR
PoseBH introduces a novel multi-dataset training framework for pose estimation that effectively handles skeletal heterogeneity and limited supervision, improving generalization across diverse datasets and transferring well to related tasks.
Contribution
It proposes nonparametric keypoint prototypes and a cross-type self-supervision mechanism to unify and enhance multi-dataset pose estimation beyond existing methods.
Findings
Improves generalization across multiple pose datasets.
Transfers effectively to hand and body shape estimation tasks.
Maintains performance on standard human pose benchmarks.
Abstract
We study multi-dataset training (MDT) for pose estimation, where skeletal heterogeneity presents a unique challenge that existing methods have yet to address. In traditional domains, \eg regression and classification, MDT typically relies on dataset merging or multi-head supervision. However, the diversity of skeleton types and limited cross-dataset supervision complicate integration in pose estimation. To address these challenges, we introduce PoseBH, a new MDT framework that tackles keypoint heterogeneity and limited supervision through two key techniques. First, we propose nonparametric keypoint prototypes that learn within a unified embedding space, enabling seamless integration across skeleton types. Second, we develop a cross-type self-supervision mechanism that aligns keypoint predictions with keypoint embedding prototypes, providing supervision without relying on teacher-student…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
