Does Robustness on ImageNet Transfer to Downstream Tasks?

Yutaro Yamada; Mayu Otani

arXiv:2204.03934·cs.CV·April 11, 2022·1 cites

Does Robustness on ImageNet Transfer to Downstream Tasks?

Yutaro Yamada, Mayu Otani

PDF

Open Access

TL;DR

This paper investigates whether robustness gained on ImageNet transfers effectively to downstream tasks like object detection, segmentation, and CIFAR10 classification, revealing architecture and task-dependent transferability of robustness.

Contribution

It demonstrates that robustness transferability varies by architecture and task, with dense prediction models transferring robustness better than CNNs, and that robustness does not always persist after fine-tuning.

Findings

01

Swin Transformer transfers robustness better than CNNs for dense prediction tasks.

02

Robust ImageNet models do not retain robustness after fine-tuning on CIFAR10.

03

Network architecture significantly influences robustness transferability.

Abstract

As clean ImageNet accuracy nears its ceiling, the research community is increasingly more concerned about robust accuracy under distributional shifts. While a variety of methods have been proposed to robustify neural networks, these techniques often target models trained on ImageNet classification. At the same time, it is a common practice to use ImageNet pretrained backbones for downstream tasks such as object detection, semantic segmentation, and image classification from different domains. This raises a question: Can these robust image classifiers transfer robustness to downstream tasks? For object detection and semantic segmentation, we find that a vanilla Swin Transformer, a variant of Vision Transformer tailored for dense prediction tasks, transfers robustness better than Convolutional Neural Networks that are trained to be robust to the corrupted version of ImageNet. For CIFAR10…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI

MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Dense Connections · Multi-Head Attention · Stochastic Depth · Dropout · Layer Normalization · Softmax