Patch-Fool: Are Vision Transformers Always Robust Against Adversarial Perturbations?
Yonggan Fu, Shunyao Zhang, Shang Wu, Cheng Wan, Yingyan Celine Lin

TL;DR
This paper challenges the assumption that vision transformers are inherently more robust than CNNs against adversarial attacks, revealing their vulnerabilities through a novel patch-based attack framework called Patch-Fool.
Contribution
It introduces Patch-Fool, a new attack method targeting ViTs' self-attention, and demonstrates that ViTs can be more vulnerable than CNNs under certain perturbations.
Findings
ViTs are not always more robust than CNNs against adversarial attacks.
Patch-Fool effectively fools ViTs by attacking individual patches.
Perturbation density and strength are key factors affecting robustness.
Abstract
Vision transformers (ViTs) have recently set off a new wave in neural architecture design thanks to their record-breaking performance in various vision tasks. In parallel, to fulfill the goal of deploying ViTs into real-world vision applications, their robustness against potential malicious attacks has gained increasing attention. In particular, recent works show that ViTs are more robust against adversarial attacks as compared with convolutional neural networks (CNNs), and conjecture that this is because ViTs focus more on capturing global interactions among different input/feature patches, leading to their improved robustness to local perturbations imposed by adversarial attacks. In this work, we ask an intriguing question: "Under what kinds of perturbations do ViTs become more vulnerable learners compared to CNNs?" Driven by this question, we first conduct a comprehensive experiment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Advanced Neural Network Applications
