Cascaded deep monocular 3D human pose estimation with evolutionary   training data

Shichao Li; Lei Ke; Kevin Pratama; Yu-Wing Tai; Chi-Keung Tang,; Kwang-Ting Cheng

arXiv:2006.07778·cs.CV·April 12, 2021

Cascaded deep monocular 3D human pose estimation with evolutionary training data

Shichao Li, Lei Ke, Kevin Pratama, Yu-Wing Tai, Chi-Keung Tang,, Kwang-Ting Cheng

PDF

1 Repo 1 Video

TL;DR

This paper introduces a scalable data augmentation method for monocular 3D human pose estimation that synthesizes vast amounts of training data, improving accuracy and generalization to unseen poses.

Contribution

The proposed hierarchical evolution-based data augmentation method effectively generates diverse training data, reducing dataset bias and enhancing model generalization for 3D human pose estimation.

Findings

01

Achieves state-of-the-art accuracy on large benchmark datasets.

02

Significantly improves generalization to unseen and rare poses.

03

Synthesizes over 8 million valid 3D poses for training.

Abstract

End-to-end deep representation learning has achieved remarkable accuracy for monocular 3D human pose estimation, yet these models may fail for unseen poses with limited and fixed training data. This paper proposes a novel data augmentation method that: (1) is scalable for synthesizing massive amount of training data (over 8 million valid 3D human poses with corresponding 2D projections) for training 2D-to-3D networks, (2) can effectively reduce dataset bias. Our method evolves a limited dataset to synthesize unseen 3D human skeletons based on a hierarchical human representation and heuristics inspired by prior knowledge. Extensive experiments show that our approach not only achieves state-of-the-art accuracy on the largest public benchmark, but also generalizes significantly better to unseen and rare poses. Code, pre-trained models and tools are available at this HTTPS URL.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Nicholasli1995/EvoSkeleton
pytorchOfficial

Videos

Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data· youtube