Generalized Jensen-Shannon Divergence Loss for Learning with Noisy   Labels

Erik Englesson; Hossein Azizpour

arXiv:2105.04522·cs.LG·November 1, 2021·22 cites

Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels

Erik Englesson, Hossein Azizpour

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a generalized Jensen-Shannon divergence loss that interpolates between cross entropy and mean absolute error, improving learning robustness with noisy labels and achieving state-of-the-art results on noisy datasets.

Contribution

It proposes a novel generalized Jensen-Shannon divergence loss that enhances robustness to noisy labels by encouraging consistency around data points.

Findings

01

Achieves state-of-the-art results on CIFAR with synthetic noise.

02

Outperforms existing methods on WebVision with real-world noise.

03

Demonstrates improved robustness across varying noise rates.

Abstract

Prior works have found it beneficial to combine provably noise-robust loss functions e.g., mean absolute error (MAE) with standard categorical loss function e.g. cross entropy (CE) to improve their learnability. Here, we propose to use Jensen-Shannon divergence as a noise-robust loss function and show that it interestingly interpolate between CE and MAE with a controllable mixing parameter. Furthermore, we make a crucial observation that CE exhibit lower consistency around noisy data points. Based on this observation, we adopt a generalized version of the Jensen-Shannon divergence for multiple distributions to encourage consistency around data points. Using this loss function, we show state-of-the-art results on both synthetic (CIFAR), and real-world (e.g., WebVision) noise with varying noise rates.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

erikenglesson/gjs
pytorchOfficial

Videos

Generalized Jensen-Shannon Divergence Loss for Learning with Noisy Labels· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Statistical Methods and Models · Imbalanced Data Classification Techniques