On The Power of Curriculum Learning in Training Deep Networks
Guy Hacohen, Daphna Weinshall

TL;DR
This paper investigates how curriculum learning, which involves non-uniform data sampling based on difficulty, can accelerate training and improve the performance of deep neural networks, especially CNNs for image recognition.
Contribution
It introduces methods for sorting training examples by difficulty and guides sampling with pacing functions, supported by empirical and theoretical analysis.
Findings
Curriculum learning increases training speed.
It improves final test performance.
Theoretical analysis shows it modifies the optimization landscape.
Abstract
Training neural networks is traditionally done by providing a sequence of random mini-batches sampled uniformly from the entire training data. In this work, we analyze the effect of curriculum learning, which involves the non-uniform sampling of mini-batches, on the training of deep networks, and specifically CNNs trained for image recognition. To employ curriculum learning, the training algorithm must resolve 2 problems: (i) sort the training examples by difficulty; (ii) compute a series of mini-batches that exhibit an increasing level of difficulty. We address challenge (i) using two methods: transfer learning from some competitive ``teacher" network, and bootstrapping. In our empirical evaluation, both methods show similar benefits in terms of increased learning speed and improved final performance on test data. We address challenge (ii) by investigating different pacing functions to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Machine Learning and Data Classification · Machine Learning and Algorithms
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
