An Impartial Take to the CNN vs Transformer Robustness Contest

Francesco Pinto; Philip H.S. Torr; Puneet K. Dokania

arXiv:2207.11347·cs.CV·July 26, 2022·6 cites

An Impartial Take to the CNN vs Transformer Robustness Contest

Francesco Pinto, Philip H.S. Torr, Puneet K. Dokania

PDF

Open Access

TL;DR

This paper empirically compares CNNs and Transformers in terms of robustness and reliability, showing that recent CNNs like ConvNeXt can match or surpass Transformers, with no clear overall winner.

Contribution

It provides a comprehensive empirical analysis demonstrating that modern CNNs are as robust and reliable as Transformers, challenging the common belief of Transformers' superiority.

Findings

01

Recent CNNs can be as robust as Transformers

02

No architecture consistently outperforms the other

03

Both architectures share similar vulnerabilities

Abstract

Following the surge of popularity of Transformers in Computer Vision, several studies have attempted to determine whether they could be more robust to distribution shifts and provide better uncertainty estimates than Convolutional Neural Networks (CNNs). The almost unanimous conclusion is that they are, and it is often conjectured more or less explicitly that the reason of this supposed superiority is to be attributed to the self-attention mechanism. In this paper we perform extensive empirical analyses showing that recent state-of-the-art CNNs (particularly, ConvNeXt) can be as robust and reliable or even sometimes more than the current state-of-the-art Transformers. However, there is no clear winner. Therefore, although it is tempting to state the definitive superiority of one family of architectures over another, they seem to enjoy similar extraordinary performances on a variety of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications