GANs for learning from very high class conditional noisy labels
Sandhya Tripathi, N Hemachandra

TL;DR
This paper introduces GAN-based methods to robustly learn binary classifiers from high rates of class-conditional label noise, outperforming existing approaches without needing noise rate estimation.
Contribution
Proposes novel GAN schemes using Wasserstein GANs and data representation changes to effectively handle very high class-conditional noise rates in binary classification.
Findings
Significant improvement over existing methods at high noise rates
Effective in high-dimensional, imbalanced, and real-world datasets
Consistent performance demonstrated across multiple datasets and noise levels
Abstract
We use Generative Adversarial Networks (GANs) to design a class conditional label noise (CCN) robust scheme for binary classification. It first generates a set of correctly labelled data points from noisy labelled data and 0.1% or 1% clean labels such that the generated and true (clean) labelled data distributions are close; generated labelled data is used to learn a good classifier. The mode collapse problem while generating correct feature-label pairs and the problem of skewed feature-label dimension ratio ( 784:1) are avoided by using Wasserstein GAN and a simple data representation change. Another WGAN with information-theoretic flavour on top of the new representation is also proposed. The major advantage of both schemes is their significant improvement over the existing ones in presence of very high CCN rates, without either estimating or cross-validating over the noise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification · Multidisciplinary Science and Engineering Research
MethodsConvolution · Wasserstein GAN
