Fixing the train-test resolution discrepancy
Hugo Touvron, Andrea Vedaldi, Matthijs Douze, Herv\'e J\'egou

TL;DR
This paper addresses the resolution discrepancy between training and testing in image classification, proposing a simple fine-tuning method to improve performance, enabling training with smaller images and achieving state-of-the-art results.
Contribution
It introduces an efficient fine-tuning strategy to optimize classifier performance across different resolutions, allowing effective training with smaller images and achieving high accuracy on ImageNet.
Findings
Lower train resolution can improve test accuracy.
Test resolution fine-tuning enhances classifier performance.
Achieved state-of-the-art ImageNet accuracy with resolution optimization.
Abstract
Data-augmentation is key to the training of neural networks for image classification. This paper first shows that existing augmentations induce a significant discrepancy between the typical size of the objects seen by the classifier at train and test time. We experimentally validate that, for a target test resolution, using a lower train resolution offers better classification at test time. We then propose a simple yet effective and efficient strategy to optimize the classifier performance when the train and test resolutions differ. It involves only a computationally cheap fine-tuning of the network at the test resolution. This enables training strong classifiers using small training images. For instance, we obtain 77.1% top-1 accuracy on ImageNet with a ResNet-50 trained on 128x128 images, and 79.8% with one trained on 224x224 image. In addition, if we use extra training data we get…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsAverage Pooling · ResNeXt Block · Grouped Convolution · Bottleneck Residual Block · Global Average Pooling · Residual Block · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · Kaiming Initialization · Max Pooling
