Scale Calibrated Training: Improving Generalization of Deep Networks via   Scale-Specific Normalization

Zhuoran Yu; Aojun Zhou; Yukun Ma; Yudian Li; Xiaohan Zhang; Ping Luo

arXiv:1909.00182·cs.CV·September 8, 2020·1 cites

Scale Calibrated Training: Improving Generalization of Deep Networks via Scale-Specific Normalization

Zhuoran Yu, Aojun Zhou, Yukun Ma, Yudian Li, Xiaohan Zhang, Ping Luo

PDF

Open Access

TL;DR

This paper introduces Scale Calibrated Training (SCT), a novel method enabling CNNs to generalize across multiple image scales during testing, significantly improving accuracy on low-resolution images.

Contribution

The paper proposes SCT and a new normalization scheme called Scale-Specific Batch Normalization, enhancing CNN robustness to varying input scales.

Findings

01

SCT improves ResNet-50 accuracy by 1.7% on 224-sized images.

02

SCT improves ResNet-50 accuracy by 11.5% on 128-sized images.

03

Scale-Specific Batch Normalization outperforms vanilla batch normalization in SCT.

Abstract

Standard convolutional neural networks(CNNs) require consistent image resolutions in both training and testing phase. However, in practice, testing with smaller image sizes is necessary for fast inference. We show that trivially evaluating low-resolution images on networks trained with high-resolution images results in a catastrophic accuracy drop in standard CNN architectures. We propose a novel training regime called Scale calibrated Training(SCT) which allows networks to learn from various scales of input simultaneously. By taking advantages of SCT, single network can provide decent accuracy at test time in response to multiple test scales. In our analysis, we surprisingly find that vanilla batch normalization can lead to sub-optimal performance in SCT. Therefore, a novel normalization scheme called Scale-Specific Batch Normalization is equipped to SCT in replacement of batch…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Generative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning