GANs for learning from very high class conditional noisy labels

Sandhya Tripathi; N Hemachandra

arXiv:2010.09577·cs.LG·October 20, 2020

GANs for learning from very high class conditional noisy labels

Sandhya Tripathi, N Hemachandra

PDF

Open Access

TL;DR

This paper introduces GAN-based methods to robustly learn binary classifiers from high rates of class-conditional label noise, outperforming existing approaches without needing noise rate estimation.

Contribution

Proposes novel GAN schemes using Wasserstein GANs and data representation changes to effectively handle very high class-conditional noise rates in binary classification.

Findings

01

Significant improvement over existing methods at high noise rates

02

Effective in high-dimensional, imbalanced, and real-world datasets

03

Consistent performance demonstrated across multiple datasets and noise levels

Abstract

We use Generative Adversarial Networks (GANs) to design a class conditional label noise (CCN) robust scheme for binary classification. It first generates a set of correctly labelled data points from noisy labelled data and 0.1% or 1% clean labels such that the generated and true (clean) labelled data distributions are close; generated labelled data is used to learn a good classifier. The mode collapse problem while generating correct feature-label pairs and the problem of skewed feature-label dimension ratio ( $\sim$ 784:1) are avoided by using Wasserstein GAN and a simple data representation change. Another WGAN with information-theoretic flavour on top of the new representation is also proposed. The major advantage of both schemes is their significant improvement over the existing ones in presence of very high CCN rates, without either estimating or cross-validating over the noise…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification · Multidisciplinary Science and Engineering Research

MethodsConvolution · Wasserstein GAN