ViTGuard: Attention-aware Detection against Adversarial Examples for Vision Transformer
Shihua Sun, Kenechukwu Nwodo, Shridatt Sugrim, Angelos Stavrou,, Haining Wang

TL;DR
ViTGuard is a novel detection framework that leverages masked autoencoders and ViT-specific features to effectively identify adversarial examples, including patch attacks, without prior exposure to adversarial samples.
Contribution
This paper introduces ViTGuard, the first general detection method tailored for Vision Transformers, capable of defending against both full-image and patch adversarial attacks.
Findings
ViTGuard outperforms seven existing detection methods across multiple datasets and attack types.
It maintains robustness against adaptive evasion attacks.
The method effectively detects unseen adversarial attacks without adversarial training.
Abstract
The use of transformers for vision tasks has challenged the traditional dominant role of convolutional neural networks (CNN) in computer vision (CV). For image classification tasks, Vision Transformer (ViT) effectively establishes spatial relationships between patches within images, directing attention to important areas for accurate predictions. However, similar to CNNs, ViTs are vulnerable to adversarial attacks, which mislead the image classifier into making incorrect decisions on images with carefully designed perturbations. Moreover, adversarial patch attacks, which introduce arbitrary perturbations within a small area, pose a more serious threat to ViTs. Even worse, traditional detection methods, originally designed for CNN models, are impractical or suffer significant performance degradation when applied to ViTs, and they generally overlook patch attacks. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · CCD and CMOS Imaging Sensors · Anomaly Detection Techniques and Applications
MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Vision Transformer · Softmax · Layer Normalization · Dropout
