The Fisher-Rao Loss for Learning under Label Noise

Henrique K. Miyamoto; F\'abio C. C. Meneghetti; Sueli I. R. Costa

arXiv:2210.16401·cs.LG·May 15, 2024

The Fisher-Rao Loss for Learning under Label Noise

Henrique K. Miyamoto, F\'abio C. C. Meneghetti, Sueli I. R. Costa

PDF

Open Access

TL;DR

This paper investigates the Fisher-Rao loss function's robustness to label noise, deriving bounds and analyzing its learning dynamics, supported by experiments on synthetic and MNIST datasets.

Contribution

It introduces the Fisher-Rao loss for label noise robustness, providing theoretical bounds and analyzing its training behavior compared to other losses.

Findings

01

Fisher-Rao loss offers a good trade-off between robustness and training speed.

02

Theoretical upper bounds on performance degradation are established.

03

Numerical experiments demonstrate its effectiveness on synthetic and MNIST datasets.

Abstract

Choosing a suitable loss function is essential when learning by empirical risk minimisation. In many practical cases, the datasets used for training a classifier may contain incorrect labels, which prompts the interest for using loss functions that are inherently robust to label noise. In this paper, we study the Fisher-Rao loss function, which emerges from the Fisher-Rao distance in the statistical manifold of discrete distributions. We derive an upper bound for the performance degradation in the presence of label noise, and analyse the learning speed of this loss. Comparing with other commonly used losses, we argue that the Fisher-Rao loss provides a natural trade-off between robustness and training dynamics. Numerical experiments with synthetic and MNIST datasets illustrate this performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and Data Classification · Advanced Statistical Methods and Models

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings