DIY Human Action Data Set Generation
Mehran Khodabandeh, Hamid Reza Vaezi Joze, Ilya Zharkov, Vivek, Pradeep

TL;DR
This paper presents a novel method for generating extensive human action video datasets from small initial sets by partitioning, manipulating, and reassembling video components, significantly aiding deep learning training.
Contribution
It introduces a new approach to decompose and reassemble action videos, along with a skeleton trajectory generation method, to create unlimited training data for human action recognition.
Findings
Generated larger datasets improved neural network performance
Method reduced need for costly data collection
Effective on small existing datasets
Abstract
The recent successes in applying deep learning techniques to solve standard computer vision problems has aspired researchers to propose new computer vision problems in different domains. As previously established in the field, training data itself plays a significant role in the machine learning process, especially deep learning approaches which are data hungry. In order to solve each new problem and get a decent performance, a large amount of data needs to be captured which may in many cases pose logistical difficulties. Therefore, the ability to generate de novo data or expand an existing data set, however small, in order to satisfy data requirement of current networks may be invaluable. Herein, we introduce a novel way to partition an action video clip into action, subject and context. Each part is manipulated separately and reassembled with our proposed video generation technique.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
