Bag of Tricks for Image Classification with Convolutional Neural Networks
Tong He, Zhi Zhang, Hang Zhang, Zhongyue Zhang, Junyuan Xie, Mu Li

TL;DR
This paper systematically evaluates various training refinements for CNNs in image classification, demonstrating significant accuracy improvements and better transfer learning performance.
Contribution
It provides an empirical analysis of training tricks for CNNs, showing their combined effect on improving accuracy and transfer learning.
Findings
ResNet-50 accuracy improved from 75.3% to 79.29% on ImageNet
Combining training tricks yields significant accuracy gains
Improved models enhance transfer learning in detection and segmentation
Abstract
Much of the recent progress made in image classification research can be credited to training procedure refinements, such as changes in data augmentations and optimization methods. In the literature, however, most refinements are either briefly mentioned as implementation details or only visible in source code. In this paper, we will examine a collection of such refinements and empirically evaluate their impact on the final model accuracy through ablation study. We will show that, by combining these refinements together, we are able to improve various CNN models significantly. For example, we raise ResNet-50's top-1 validation accuracy from 75.3% to 79.29% on ImageNet. We will also demonstrate that improvement on image classification accuracy leads to better transfer learning performance in other application domains such as object detection and semantic segmentation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗glasses/cse_resnet50model· 1 dl1 dl
- 🤗glasses/dummymodel· 6 dl6 dl
- 🤗glasses/eca_resnet26tmodel· 30 dl30 dl
- 🤗glasses/resnet152model· 32 dl32 dl
- 🤗glasses/resnet18model· 49 dl49 dl
- 🤗glasses/resnet26model· 32 dl32 dl
- 🤗glasses/resnet26dmodel· 28 dl28 dl
- 🤗glasses/resnet34model· 40 dl40 dl
- 🤗glasses/resnet34dmodel· 35 dl35 dl
- 🤗glasses/resnet50model· 37 dl37 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsResidual Connection · Bottleneck Residual Block · Global Average Pooling · Residual Block · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Average Pooling · 1x1 Convolution · Convolution · Batch Normalization
