A Comprehensive Study on Robustness of Image Classification Models:   Benchmarking and Rethinking

Chang Liu; Yinpeng Dong; Wenzhao Xiang; Xiao Yang; Hang Su; Jun Zhu,; Yuefeng Chen; Yuan He; Hui Xue; Shibao Zheng

arXiv:2302.14301·cs.CV·March 1, 2023·6 cites

A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking

Chang Liu, Yinpeng Dong, Wenzhao Xiang, Xiao Yang, Hang Su, Jun Zhu,, Yuefeng Chen, Yuan He, Hui Xue, Shibao Zheng

PDF

Open Access

TL;DR

This paper introduces ARES-Bench, a comprehensive robustness benchmark for image classification models, evaluating 55 models across various attacks and datasets, revealing key trade-offs and improvements in robustness techniques.

Contribution

The paper establishes a large-scale robustness benchmark for image classification, providing extensive evaluations and insights into model robustness, and achieves state-of-the-art adversarial robustness through optimized training settings.

Findings

01

Inherent trade-off between adversarial and natural robustness.

02

Adversarial training enhances robustness, especially for Transformers.

03

Pre-training improves natural robustness with more data or self-supervised learning.

Abstract

The robustness of deep neural networks is usually lacking under adversarial examples, common corruptions, and distribution shifts, which becomes an important research problem in the development of deep learning. Although new deep learning methods and robustness improvement techniques have been constantly proposed, the robustness evaluations of existing methods are often inadequate due to their rapid development, diverse noise patterns, and simple evaluation metrics. Without thorough robustness evaluations, it is hard to understand the advances in the field and identify the effective methods. In this paper, we establish a comprehensive robustness benchmark called \textbf{ARES-Bench} on the image classification task. In our benchmark, we evaluate the robustness of 55 typical deep learning models on ImageNet with diverse architectures (e.g., CNNs, Transformers) and learning algorithms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Anomaly Detection Techniques and Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Label Smoothing · Softmax · Adam · Layer Normalization · Residual Connection · Dense Connections