ParGANDA: Making Synthetic Pedestrians A Reality For Object Detection

Daria Reshetova; Guanhang Wu; Marcel Puyat; Chunhui Gu; Huizhong Chen

arXiv:2307.11360·cs.CV·July 24, 2023

ParGANDA: Making Synthetic Pedestrians A Reality For Object Detection

Daria Reshetova, Guanhang Wu, Marcel Puyat, Chunhui Gu, Huizhong Chen

PDF

Open Access

TL;DR

This paper introduces ParGANDA, a method that uses GAN-based image translation to improve synthetic pedestrian data realism for object detection, reducing the domain gap without needing real data labels.

Contribution

The paper presents a novel GAN-based approach for unpaired image translation that maintains annotation accuracy, enhancing synthetic data utility for pedestrian detection tasks.

Findings

01

Improved detection performance using synthetic data with GAN translation.

02

Generated images are visually plausible and domain-adapted.

03

Method eliminates the need for real data labels in training.

Abstract

Object detection is the key technique to a number of Computer Vision applications, but it often requires large amounts of annotated data to achieve decent results. Moreover, for pedestrian detection specifically, the collected data might contain some personally identifiable information (PII), which is highly restricted in many countries. This label intensive and privacy concerning task has recently led to an increasing interest in training the detection models using synthetically generated pedestrian datasets collected with a photo-realistic video game engine. The engine is able to generate unlimited amounts of data with precise and consistent annotations, which gives potential for significant gains in the real-world applications. However, the use of synthetic data for training introduces a synthetic-to-real domain shift aggravating the final performance. To close the gap between the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Human Pose and Action Recognition