Sampling Strategies for GAN Synthetic Data

Binod Bhattarai; Seungryul Baek; Rumeysa Bodur; Tae-Kyun Kim

arXiv:1909.04689·cs.CV·June 18, 2020

Sampling Strategies for GAN Synthetic Data

Binod Bhattarai, Seungryul Baek, Rumeysa Bodur, Tae-Kyun Kim

PDF

TL;DR

This paper introduces novel sampling strategies for GAN-generated synthetic data, leveraging GAN training parameters and reinforcement learning to improve the quality and diversity of data used for training deep CNNs, leading to better performance.

Contribution

It presents simple, effective sampling methods that utilize GAN discriminator confidence and RL-based subset selection to enhance CNN training with synthetic data.

Findings

01

Sampling synthetic data improves CNN accuracy.

02

Using discriminator confidence enhances data quality.

03

Reinforcement learning effectively selects meaningful synthetic examples.

Abstract

Generative Adversarial Networks (GANs) have been used widely to generate large volumes of synthetic data. This data is being utilized for augmenting with real examples in order to train deep Convolutional Neural Networks (CNNs). Studies have shown that the generated examples lack sufficient realism to train deep CNNs and are poor in diversity. Unlike previous studies of randomly augmenting the synthetic data with real data, we present our simple, effective and easy to implement synthetic data sampling methods to train deep CNNs more efficiently and accurately. To this end, we propose to maximally utilize the parameters learned during training of the GAN itself. These include discriminator's realism confidence score and the confidence on the target label of the synthetic data. In addition to this, we explore reinforcement learning (RL) to automatically search a subset of meaningful…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConvolution · Dogecoin Customer Service Number +1-833-534-1729