SynPlay: Large-Scale Synthetic Human Data with Real-World Diversity for Aerial-View Perception

Jinsub Yim; Hyungtae Lee; Sungmin Eum; Yi-Ting Shen; Yan Zhang; Heesung Kwon; Shuvra S. Bhattacharyya

arXiv:2408.11814·cs.CV·December 2, 2025

SynPlay: Large-Scale Synthetic Human Data with Real-World Diversity for Aerial-View Perception

Jinsub Yim, Hyungtae Lee, Sungmin Eum, Yi-Ting Shen, Yan Zhang, Heesung Kwon, Shuvra S. Bhattacharyya

PDF

Open Access

TL;DR

SynPlay is a large-scale synthetic dataset designed for aerial-view human perception, featuring diverse behaviors and multi-camera perspectives to improve localization models in long-range, data-scarce scenarios.

Contribution

It introduces a novel rule-guided motion generation framework and a multi-camera setup, capturing diverse, spontaneous human behaviors from aerial viewpoints for the first time.

Findings

01

Training with SynPlay improves localization accuracy in few-shot scenarios.

02

The dataset enables models to better handle long-range and small-scale human detection.

03

SynPlay's diversity enhances generalization in aerial human perception tasks.

Abstract

We introduce SynPlay, a large-scale synthetic human dataset purpose-built for advancing multi-perspective human localization, with a predominant focus on aerial-view perception. SynPlay departs from traditional synthetic datasets by addressing a critical but underexplored challenge: localizing humans in aerial scenes where subjects often occupy only tens of pixels in the image. In such scenarios, fine-grained details like facial features or textures become irrelevant, shifting the burden of recognition to human motion, behavior, and interactions. To meet this need, SynPlay implements a novel rule-guided motion generation framework that combines real-world motion capture with motion evolution graphs. This design enables human actions to evolve dynamically through high-level game rules rather than predefined scripts, resulting in effectively uncountable motion variations. Unlike existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsContext-Aware Activity Recognition Systems

MethodsFocus