ViTGuard: Attention-aware Detection against Adversarial Examples for   Vision Transformer

Shihua Sun; Kenechukwu Nwodo; Shridatt Sugrim; Angelos Stavrou,; Haining Wang

arXiv:2409.13828·cs.CV·September 24, 2024

ViTGuard: Attention-aware Detection against Adversarial Examples for Vision Transformer

Shihua Sun, Kenechukwu Nwodo, Shridatt Sugrim, Angelos Stavrou,, Haining Wang

PDF

Open Access

TL;DR

ViTGuard is a novel detection framework that leverages masked autoencoders and ViT-specific features to effectively identify adversarial examples, including patch attacks, without prior exposure to adversarial samples.

Contribution

This paper introduces ViTGuard, the first general detection method tailored for Vision Transformers, capable of defending against both full-image and patch adversarial attacks.

Findings

01

ViTGuard outperforms seven existing detection methods across multiple datasets and attack types.

02

It maintains robustness against adaptive evasion attacks.

03

The method effectively detects unseen adversarial attacks without adversarial training.

Abstract

The use of transformers for vision tasks has challenged the traditional dominant role of convolutional neural networks (CNN) in computer vision (CV). For image classification tasks, Vision Transformer (ViT) effectively establishes spatial relationships between patches within images, directing attention to important areas for accurate predictions. However, similar to CNNs, ViTs are vulnerable to adversarial attacks, which mislead the image classifier into making incorrect decisions on images with carefully designed perturbations. Moreover, adversarial patch attacks, which introduce arbitrary perturbations within a small area, pose a more serious threat to ViTs. Even worse, traditional detection methods, originally designed for CNN models, are impractical or suffer significant performance degradation when applied to ViTs, and they generally overlook patch attacks. In this paper, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · CCD and CMOS Imaging Sensors · Anomaly Detection Techniques and Applications

MethodsAttention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Vision Transformer · Softmax · Layer Normalization · Dropout