EfficientTrain++: Generalized Curriculum Learning for Efficient Visual Backbone Training
Yulin Wang, Yang Yue, Rui Lu, Yizeng Han, Shiji Song, Gao Huang

TL;DR
EfficientTrain++ introduces a generalized curriculum learning approach that accelerates training of visual backbones by progressively exposing models to more complex features through frequency-based input modulation, significantly reducing training time.
Contribution
This work generalizes curriculum learning for visual models by using a soft-selection of features via Fourier spectrum cropping, enabling faster training without accuracy loss.
Findings
Reduces training time by 1.5-3.0x on ImageNet datasets.
Effective in supervised and self-supervised learning scenarios.
Maintains model accuracy despite faster training.
Abstract
The superior performance of modern visual backbones usually comes with a costly training procedure. We contribute to this issue by generalizing the idea of curriculum learning beyond its original formulation, i.e., training models using easier-to-harder data. Specifically, we reformulate the training curriculum as a soft-selection function, which uncovers progressively more difficult patterns within each example during training, instead of performing easier-to-harder sample selection. Our work is inspired by an intriguing observation on the learning dynamics of visual backbones: during the earlier stages of training, the model predominantly learns to recognize some 'easier-to-learn' discriminative patterns in the data. These patterns, when observed through frequency and spatial domains, incorporate lower-frequency components, and the natural image contents without distortion or data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Processing Techniques and Applications
