Generating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models
C\'esar Roberto de Souza, Adrien Gaidon, Yohann Cabon, Naila Murray,, Antonio Manuel L\'opez

TL;DR
This paper introduces PHAV, a large synthetic human action video dataset generated via a novel procedural and physics-based approach using game engines, which improves action recognition models when combined with real data.
Contribution
The paper presents a new interpretable generative model for creating diverse, realistic, and physically plausible human action videos, including synthetic categories, and a multi-modal data generation pipeline.
Findings
Synthetic data boosts recognition performance on benchmarks.
PHAV dataset contains nearly 40,000 videos across 35 categories.
Synthetic videos outperform fine-tuned unsupervised generative models.
Abstract
Deep video action recognition models have been highly successful in recent years but require large quantities of manually annotated data, which are expensive and laborious to obtain. In this work, we investigate the generation of synthetic training data for video action recognition, as synthetic data have been successfully used to supervise models for a variety of other computer vision tasks. We propose an interpretable parametric generative model of human action videos that relies on procedural generation, physics models and other components of modern game engines. With this model we generate a diverse, realistic, and physically plausible dataset of human action videos, called PHAV for "Procedural Human Action Videos". PHAV contains a total of 39,982 videos, with more than 1,000 examples for each of 35 action categories. Our video generation approach is not limited to existing motion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
