VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual   Data

Jiajun Su; Chunyu Wang; Xiaoxuan Ma; Wenjun Zeng; and Yizhou Wang

arXiv:2207.09949·cs.CV·July 21, 2022·1 cites

VirtualPose: Learning Generalizable 3D Human Pose Models from Virtual Data

Jiajun Su, Chunyu Wang, Xiaoxuan Ma, Wenjun Zeng, and Yizhou Wang

PDF

Open Access 1 Repo

TL;DR

VirtualPose introduces a two-stage framework that leverages virtual data generation to improve the generalization of 3D human pose estimation across different cameras, poses, and appearances, without relying on paired real data.

Contribution

The paper proposes VirtualPose, a novel two-stage learning framework that exploits virtual data generation to enhance generalization in 3D human pose estimation.

Findings

01

Outperforms state-of-the-art methods without using paired real data.

02

Reduces overfitting by training on diverse 2D datasets and virtual AGR.

03

Improves robustness across different cameras, poses, and appearances.

Abstract

While monocular 3D pose estimation seems to have achieved very accurate results on the public datasets, their generalization ability is largely overlooked. In this work, we perform a systematic evaluation of the existing methods and find that they get notably larger errors when tested on different cameras, human poses and appearance. To address the problem, we introduce VirtualPose, a two-stage learning framework to exploit the hidden "free lunch" specific to this task, i.e. generating infinite number of poses and cameras for training models at no cost. To that end, the first stage transforms images to abstract geometry representations (AGR), and then the second maps them to 3D poses. It addresses the generalization issue from two aspects: (1) the first stage can be trained on diverse 2D datasets to reduce the risk of over-fitting to limited appearance; (2) the second stage can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wkom/virtualpose
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Video Surveillance and Tracking Methods