ResNet strikes back: An improved training procedure in timm
Ross Wightman, Hugo Touvron, Herv\'e J\'egou

TL;DR
This paper revisits ResNet-50 training, incorporating recent best practices in optimization and data augmentation, resulting in improved performance and better baseline models for future research.
Contribution
The authors provide a revised training procedure for ResNet-50 that leverages recent advances, offering improved benchmarks and open-source models in the timm library.
Findings
ResNet-50 achieves 80.4% top-1 accuracy on ImageNet with new training settings.
The improved training procedure enhances baseline performance for ResNet models.
Open-source pre-trained models are provided for future research use.
Abstract
The influential Residual Networks designed by He et al. remain the gold-standard architecture in numerous scientific publications. They typically serve as the default architecture in studies, or as baselines when new architectures are proposed. Yet there has been significant progress on best practices for training neural networks since the inception of the ResNet architecture in 2015. Novel optimization & data-augmentation have increased the effectiveness of the training recipes. In this paper, we re-evaluate the performance of the vanilla ResNet-50 when trained with a procedure that integrates such advances. We share competitive training settings and pre-trained models in the timm open-source library, with the hope that they will serve as better baselines for future work. For instance, with our more demanding training setting, a vanilla ResNet-50 reaches 80.4% top-1 accuracy at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗timm/efficientnet_b0.ra_in1kmodel· 1.0M dl· ♡ 51.0M dl♡ 5
- 🤗timm/efficientnet_b2.ra_in1kmodel· 220k dl220k dl
- 🤗timm/efficientnet_b3.ra2_in1kmodel· 115k dl· ♡ 5115k dl♡ 5
- 🤗timm/efficientnet_b4.ra2_in1kmodel· 121k dl121k dl
- 🤗timm/efficientnet_el.ra_in1kmodel· 252 dl252 dl
- 🤗timm/efficientnet_em.ra2_in1kmodel· 825 dl825 dl
- 🤗timm/efficientnet_es.ra_in1kmodel· 185 dl185 dl
- 🤗timm/efficientnet_lite0.ra_in1kmodel· 4.2k dl4.2k dl
- 🤗timm/efficientnetv2_rw_m.agc_in1kmodel· 56k dl· ♡ 156k dl♡ 1
- 🤗timm/efficientnetv2_rw_s.ra2_in1kmodel· 10k dl· ♡ 110k dl♡ 1
Videos
ResNet Strikes Back! | Patches Are All You Need? | Papers Explained· youtube
Taxonomy
TopicsAdvanced Neural Network Applications · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Batch Normalization · 1x1 Convolution · Residual Block · Bottleneck Residual Block · Kaiming Initialization · Average Pooling · Global Average Pooling · Convolution
