PatchCensor: Patch Robustness Certification for Transformers via   Exhaustive Testing

Yuheng Huang; Lei Ma; Yuanchun Li

arXiv:2111.10481·cs.CV·April 25, 2023·5 cites

PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing

Yuheng Huang, Lei Ma, Yuanchun Li

PDF

Open Access

TL;DR

PatchCensor certifies the robustness of Vision Transformers against adversarial patches through exhaustive testing, providing statistical guarantees without retraining models, thus enhancing safety in critical applications.

Contribution

It introduces a novel certification method for ViT robustness against patches using exhaustive testing and voting over mutated attention masks, avoiding extensive retraining.

Findings

01

Achieves 67.1% certified accuracy on ImageNet for 2% adversarial patches.

02

Outperforms state-of-the-art robustness certification techniques.

03

Maintains high clean accuracy of 81.8% on ImageNet.

Abstract

Vision Transformer (ViT) is known to be highly nonlinear like other classical neural networks and could be easily fooled by both natural and adversarial patch perturbations. This limitation could pose a threat to the deployment of ViT in the real industrial environment, especially in safety-critical scenarios. In this work, we propose PatchCensor, aiming to certify the patch robustness of ViT by applying exhaustive testing. We try to provide a provable guarantee by considering the worst patch attack scenarios. Unlike empirical defenses against adversarial patches that may be adaptively breached, certified robust approaches can provide a certified accuracy against arbitrary attacks under certain conditions. However, existing robustness certifications are mostly based on robust training, which often requires substantial training efforts and the sacrifice of model performance on normal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Advanced Neural Network Applications

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Residual Connection · Dense Connections · Softmax