TL;DR
AGORA is a highly realistic synthetic dataset with detailed ground truth for 3D human pose estimation, including children, designed to bridge the gap between current datasets and real-world scenarios.
Contribution
The paper introduces AGORA, a new synthetic dataset with high realism, detailed annotations including face and hands, and diverse human subjects, especially children, to improve 3D pose estimation methods.
Findings
Existing methods perform poorly on children in AGORA.
Fine-tuning on AGORA improves performance on AGORA and 3DPW.
Extending SMPL-X improves modeling of children's body shapes.
Abstract
While the accuracy of 3D human pose estimation from images has steadily improved on benchmark datasets, the best methods still fail in many real-world scenarios. This suggests that there is a domain gap between current datasets and common scenes containing people. To obtain ground-truth 3D pose, current datasets limit the complexity of clothing, environmental conditions, number of subjects, and occlusion. Moreover, current datasets evaluate sparse 3D joint locations corresponding to the major joints of the body, ignoring the hand pose and the face shape. To evaluate the current state-of-the-art methods on more challenging images, and to drive the field to address new problems, we introduce AGORA, a synthetic dataset with high realism and highly accurate ground truth. Here we use 4240 commercially-available, high-quality, textured human scans in diverse poses and natural clothing; this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
