Identifying Hard Noise in Long-Tailed Sample Distribution
Xuanyu Yi, Kaihua Tang, Xian-Sheng Hua, Joo-Hwee Lim, Hanwang Zhang

TL;DR
This paper addresses the challenge of identifying and removing hard noises in long-tailed data distributions, proposing a novel iterative framework that improves de-noising performance in imbalanced datasets.
Contribution
The paper introduces the Hard-to-Easy (H2E) framework, a new iterative method that effectively identifies hard noises in long-tailed data, outperforming existing de-noising techniques.
Findings
H2E outperforms state-of-the-art de-noising methods on long-tailed benchmarks.
H2E maintains stable performance on balanced datasets.
Hard noises are effectively reduced to easy noises through the iterative process.
Abstract
Conventional de-noising methods rely on the assumption that all samples are independent and identically distributed, so the resultant classifier, though disturbed by noise, can still easily identify the noises as the outliers of training distribution. However, the assumption is unrealistic in large-scale data that is inevitably long-tailed. Such imbalanced training data makes a classifier less discriminative for the tail classes, whose previously "easy" noises are now turned into "hard" ones -- they are almost as outliers as the clean tail samples. We introduce this new challenge as Noisy Long-Tailed Classification (NLT). Not surprisingly, we find that most de-noising methods fail to identify the hard noises, resulting in significant performance drop on the three proposed NLT benchmarks: ImageNet-NLT, Animal10-NLT, and Food101-NLT. To this end, we design an iterative noisy learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Image and Signal Denoising Methods · Image Processing Techniques and Applications
