Does Robustness on ImageNet Transfer to Downstream Tasks?
Yutaro Yamada, Mayu Otani

TL;DR
This paper investigates whether robustness gained on ImageNet transfers effectively to downstream tasks like object detection, segmentation, and CIFAR10 classification, revealing architecture and task-dependent transferability of robustness.
Contribution
It demonstrates that robustness transferability varies by architecture and task, with dense prediction models transferring robustness better than CNNs, and that robustness does not always persist after fine-tuning.
Findings
Swin Transformer transfers robustness better than CNNs for dense prediction tasks.
Robust ImageNet models do not retain robustness after fine-tuning on CIFAR10.
Network architecture significantly influences robustness transferability.
Abstract
As clean ImageNet accuracy nears its ceiling, the research community is increasingly more concerned about robust accuracy under distributional shifts. While a variety of methods have been proposed to robustify neural networks, these techniques often target models trained on ImageNet classification. At the same time, it is a common practice to use ImageNet pretrained backbones for downstream tasks such as object detection, semantic segmentation, and image classification from different domains. This raises a question: Can these robust image classifiers transfer robustness to downstream tasks? For object detection and semantic segmentation, we find that a vanilla Swin Transformer, a variant of Vision Transformer tailored for dense prediction tasks, transfers robustness better than Convolutional Neural Networks that are trained to be robust to the corrupted version of ImageNet. For CIFAR10…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI
MethodsAttention Is All You Need · Linear Layer · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Dense Connections · Multi-Head Attention · Stochastic Depth · Dropout · Layer Normalization · Softmax
