RareGAN: Generating Samples for Rare Classes
Zinan Lin, Hao Liang, Giulia Fanti, Vyas Sekar

TL;DR
RareGAN is a novel GAN framework that effectively synthesizes rare class samples in unlabeled datasets by combining semi-supervised learning, active label querying, and weighted loss functions, outperforming prior methods.
Contribution
It introduces RareGAN, integrating semi-supervised GANs, active learning, and weighted loss to improve rare class sample generation with limited labels.
Findings
Outperforms prior methods in fidelity and diversity of rare class samples.
Effective across various applications, budgets, and architectures.
Balances fidelity and diversity better than existing approaches.
Abstract
We study the problem of learning generative adversarial networks (GANs) for a rare class of an unlabeled dataset subject to a labeling budget. This problem is motivated from practical applications in domains including security (e.g., synthesizing packets for DNS amplification attacks), systems and networking (e.g., synthesizing workloads that trigger high resource usage), and machine learning (e.g., generating images from a rare class). Existing approaches are unsuitable, either requiring fully-labeled datasets or sacrificing the fidelity of the rare class for that of the common classes. We propose RareGAN, a novel synthesis of three key ideas: (1) extending conditional GANs to use labelled and unlabelled data for better generalization; (2) an active learning approach that requests the most useful labels; and (3) a weighted loss function to favor learning the rare class. We show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Algorithms · COVID-19 diagnosis using AI · Adversarial Robustness in Machine Learning
