Which Samples Should be Learned First: Easy or Hard?

Xiaoling Zhou; Ou Wu

arXiv:2110.05481·cs.LG·September 20, 2022

Which Samples Should be Learned First: Easy or Hard?

Xiaoling Zhou, Ou Wu

PDF

Open Access

TL;DR

This paper investigates whether easy, hard, or other samples should be learned first in training, providing theoretical analysis and experiments to determine optimal sample prioritization modes based on difficulty distribution.

Contribution

It introduces a unified framework with an optimized objective function and flexible mode switching to select the best sample learning order without prior knowledge.

Findings

01

The optimal sample learning order depends on difficulty distribution.

02

Four priority modes are identified: easy, medium, hard, and two-ends.

03

Flexible switching among modes improves training effectiveness.

Abstract

An effective weighting scheme for training samples is essential for learning tasks. Numerous weighting schemes have been proposed. Some schemes take the easy-first mode, whereas some others take the hard-first one. Naturally, an interesting yet realistic question is raised. Which samples should be learned first given a new learning task, easy or hard? To answer this question, both theoretical analyses and experimental verification are conducted. First, a general optimized objective function is proposed, revealing the relationship between the difficulty distribution and the difficulty-based sample weights. Second, on the basis of the optimized objective function, theoretical answers are obtained. Besides the easy-first and hard-first modes, there are two other priority modes, namely, medium-first and two-ends-first. The prior mode does not necessarily remain unchanged during the training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Domain Adaptation and Few-Shot Learning