TL;DR
This paper introduces cyclical learning rates, a method that varies the learning rate cyclically during training, improving accuracy and reducing tuning effort across various neural network architectures and datasets.
Contribution
The paper proposes a novel cyclical learning rate schedule that eliminates the need for extensive hyper-parameter tuning and demonstrates its effectiveness on multiple datasets and architectures.
Findings
Improved classification accuracy with cyclical learning rates.
Reduced number of training iterations needed.
Effective across diverse neural network architectures.
Abstract
It is known that the learning rate is the most important hyper-parameter to tune for training deep neural networks. This paper describes a new method for setting the learning rate, named cyclical learning rates, which practically eliminates the need to experimentally find the best values and schedule for the global learning rates. Instead of monotonically decreasing the learning rate, this method lets the learning rate cyclically vary between reasonable boundary values. Training with cyclical learning rates instead of fixed values achieves improved classification accuracy without a need to tune and often in fewer iterations. This paper also describes a simple way to estimate "reasonable bounds" -- linearly increasing the learning rate of the network for a few epochs. In addition, cyclical learning rates are demonstrated on the CIFAR-10 and CIFAR-100 datasets with ResNets, Stochastic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsNEW HAMPSHIRE +256777182862 Love spells caster, voodoo spells IN NEW HAMPSHIRE- MANCHESTER, NASHUA · 1x1 Convolution · Convolution · Average Pooling · Local Response Normalization · Auxiliary Classifier · Inception Module · Grouped Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Dropout
