Human Action Recognition in Drone Videos using a Few Aerial Training Examples
Waqas Sultani, Mubarak Shah

TL;DR
This paper introduces a novel approach for aerial human action recognition in drone videos by leveraging synthetic data from video games and GAN-generated features, combined with a disjoint multitask learning framework to improve classification with limited real data.
Contribution
The paper proposes integrating game-based and GAN-generated aerial data using disjoint multitask learning to enhance action recognition with few real aerial training samples.
Findings
Game and GAN data improve recognition accuracy.
Disjoint multitask learning effectively combines heterogeneous data.
Approach outperforms baseline methods on two datasets.
Abstract
Drones are enabling new forms of human actions surveillance due to their low cost and fast mobility. However, using deep neural networks for automatic aerial action recognition is difficult due to the need for a large number of training aerial human action videos. Collecting a large number of human action aerial videos is costly, time-consuming, and difficult. In this paper, we explore two alternative data sources to improve aerial action classification when only a few training aerial examples are available. As a first data source, we resort to video games. We collect plenty of aerial game action videos using two gaming engines. For the second data source, we leverage conditional Wasserstein Generative Adversarial Networks to generate aerial features from ground videos. Given that both data sources have some limitations, e.g. game videos are biased towards specific actions categories…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
