Correlated Input-Dependent Label Noise in Large-Scale Image   Classification

Mark Collier; Basil Mustafa; Efi Kokiopoulou; Rodolphe Jenatton and; Jesse Berent

arXiv:2105.10305·cs.LG·May 24, 2021

Correlated Input-Dependent Label Noise in Large-Scale Image Classification

Mark Collier, Basil Mustafa, Efi Kokiopoulou, Rodolphe Jenatton and, Jesse Berent

PDF

TL;DR

This paper introduces a probabilistic model for input-dependent label noise in large-scale image classification, improving accuracy by capturing heteroscedastic uncertainty with a learned covariance structure.

Contribution

It proposes a novel multivariate Normal latent variable approach to model label noise, enhancing classification performance on multiple large datasets.

Findings

01

Achieved state-of-the-art accuracy on WebVision 1.0.

02

Significantly improved results on ImageNet and JFT datasets.

03

Learned covariance structures align with known noise sources.

Abstract

Large scale image classification datasets often contain noisy labels. We take a principled probabilistic approach to modelling input-dependent, also known as heteroscedastic, label noise in these datasets. We place a multivariate Normal distributed latent variable on the final hidden layer of a neural network classifier. The covariance matrix of this latent variable, models the aleatoric uncertainty due to label noise. We demonstrate that the learned covariance structure captures known sources of label noise between semantically similar and co-occurring classes. Compared to standard neural network training and other baselines, we show significantly improved accuracy on Imagenet ILSVRC 2012 79.3% (+2.6%), Imagenet-21k 47.0% (+1.1%) and JFT 64.7% (+1.6%). We set a new state-of-the-art result on WebVision 1.0 with 76.6% top-1 accuracy. These datasets range from over 1M to over 300M…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.