On Robustness and Transferability of Convolutional Neural Networks

Josip Djolonga; Jessica Yung; Michael Tschannen; Rob Romijnders; Lucas; Beyer; Alexander Kolesnikov; Joan Puigcerver; Matthias Minderer; Alexander; D'Amour; Dan Moldovan; Sylvain Gelly; Neil Houlsby; Xiaohua Zhai; Mario Lucic

arXiv:2007.08558·cs.CV·March 24, 2021

On Robustness and Transferability of Convolutional Neural Networks

Josip Djolonga, Jessica Yung, Michael Tschannen, Rob Romijnders, Lucas, Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander, D'Amour, Dan Moldovan, Sylvain Gelly, Neil Houlsby, Xiaohua Zhai, Mario Lucic

PDF

1 Repo 1 Datasets

TL;DR

This paper investigates how modern CNNs perform under distributional shifts and transfer learning, revealing that larger models and data, along with simple preprocessing changes, enhance robustness and transferability.

Contribution

It provides a systematic analysis of factors affecting CNN robustness, introduces a synthetic dataset for evaluation, and highlights the impact of data size, model scale, and preprocessing.

Findings

01

Increasing training data and model size improves robustness.

02

Simple preprocessing changes can significantly enhance transferability.

03

A new synthetic dataset enables systematic robustness evaluation.

Abstract

Modern deep convolutional networks (CNNs) are often criticized for not generalizing under distributional shifts. However, several recent breakthroughs in transfer learning suggest that these networks can cope with severe distribution shifts and successfully adapt to new tasks from a few training examples. In this work we study the interplay between out-of-distribution and transfer performance of modern image classification CNNs for the first time and investigate the impact of the pre-training data size, the model scale, and the data preprocessing pipeline. We find that increasing both the training set and model sizes significantly improve the distributional shift robustness. Furthermore, we show that, perhaps surprisingly, simple changes in the preprocessing such as modifying the image resolution can significantly mitigate robustness issues in some cases. Finally, we outline the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/si-score
tf

Datasets

tillspeicher/transforms_2d_base
dataset· 33 dl
33 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.