PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing
Yuheng Huang, Lei Ma, Yuanchun Li

TL;DR
PatchCensor certifies the robustness of Vision Transformers against adversarial patches through exhaustive testing, providing statistical guarantees without retraining models, thus enhancing safety in critical applications.
Contribution
It introduces a novel certification method for ViT robustness against patches using exhaustive testing and voting over mutated attention masks, avoiding extensive retraining.
Findings
Achieves 67.1% certified accuracy on ImageNet for 2% adversarial patches.
Outperforms state-of-the-art robustness certification techniques.
Maintains high clean accuracy of 81.8% on ImageNet.
Abstract
Vision Transformer (ViT) is known to be highly nonlinear like other classical neural networks and could be easily fooled by both natural and adversarial patch perturbations. This limitation could pose a threat to the deployment of ViT in the real industrial environment, especially in safety-critical scenarios. In this work, we propose PatchCensor, aiming to certify the patch robustness of ViT by applying exhaustive testing. We try to provide a provable guarantee by considering the worst patch attack scenarios. Unlike empirical defenses against adversarial patches that may be adaptively breached, certified robust approaches can provide a certified accuracy against arbitrary attacks under certain conditions. However, existing robustness certifications are mostly based on robust training, which often requires substantial training efforts and the sacrifice of model performance on normal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Advanced Neural Network Applications
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Adam · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Label Smoothing · Residual Connection · Dense Connections · Softmax
