Label-similarity Curriculum Learning

Urun Dogan; Aniket Anand Deshmukh; Marcin Machura; Christian Igel

arXiv:1911.06902·cs.CV·July 24, 2020

Label-similarity Curriculum Learning

Urun Dogan, Aniket Anand Deshmukh, Marcin Machura, Christian Igel

PDF

TL;DR

This paper introduces a curriculum learning method that gradually shifts label representations towards one-hot encoding based on class similarity, improving neural network training for image classification.

Contribution

It proposes a novel label-similarity curriculum learning approach that adapts loss functions by using class similarity, including a method to automatically compute similarities from word embeddings.

Findings

01

LCL improves classification accuracy across multiple datasets.

02

LCL outperforms standard training methods.

03

The approach is effective with various deep learning architectures.

Abstract

Curriculum learning can improve neural network training by guiding the optimization to desirable optima. We propose a novel curriculum learning approach for image classification that adapts the loss function by changing the label representation. The idea is to use a probability distribution over classes as target label, where the class probabilities reflect the similarity to the true class. Gradually, this label representation is shifted towards the standard one-hot-encoding. That is, in the beginning minor mistakes are corrected less than large mistakes, resembling a teaching process in which broad concepts are explained first before subtle differences are taught. The class similarity can be based on prior knowledge. For the special case of the labels being natural words, we propose a generic way to automatically compute the similarities. The natural words are embedded into Euclidean…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTest