Automatic Dataset Augmentation Using Virtual Human Simulation
Marcelo C. Ghilardi, Leandro Dihl, Estev\~ao Testa, Pedro Braga,, Jo\~ao P. Pianta, Isabel H. Manssour, Soraia R. Musse

TL;DR
This paper explores using simple-rendered virtual human images to augment training datasets for pedestrian detection, showing that synthetic data can improve machine learning performance in this task.
Contribution
It introduces a data-driven parametrization of virtual humans and demonstrates that CG images can effectively enhance training datasets for pedestrian detection.
Findings
CG images outperform real images in training datasets
Automatic ground truth generation for CG images simplifies annotation
Synthetic data improves pedestrian detection accuracy
Abstract
Virtual Human Simulation has been widely used for different purposes, such as comfort or accessibility analysis. In this paper, we investigate the possibility of using this type of technique to extend the training datasets of pedestrians to be used with machine learning techniques. Our main goal is to verify if Computer Graphics (CG) images of virtual humans with a simplistic rendering can be efficient in order to augment datasets used for training machine learning methods. In fact, from a machine learning point of view, there is a need to collect and label large datasets for ground truth, which sometimes demands manual annotation. In addition, find out images and videos with real people and also provide ground truth of people detection and counting is not trivial. If CG images, which can have a ground truth automatically generated, can also be used as training in machine learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition
