RBFormer: Improve Adversarial Robustness of Transformer by Robust Bias
Hao Cheng, Jinhao Duan, Hui Li, Lyutianyang Zhang, Jiahang Cao, Ping, Wang, Jize Zhang, Kaidi Xu, Renjing Xu

TL;DR
RBFormer is a novel Transformer-based structure that enhances adversarial robustness by increasing high-frequency robust biases, significantly improving performance on CIFAR-10 and ImageNet-1k datasets.
Contribution
The paper introduces RBFormer, a Transformer structure with increased high-frequency biases, demonstrating improved adversarial robustness without new defense mechanisms.
Findings
RBFormer outperforms baseline models with +16.12% on CIFAR-10.
RBFormer achieves +5.04% improvement on ImageNet-1k.
Enhanced robustness is achieved through structural bias adjustments.
Abstract
Recently, there has been a surge of interest and attention in Transformer-based structures, such as Vision Transformer (ViT) and Vision Multilayer Perceptron (VMLP). Compared with the previous convolution-based structures, the Transformer-based structure under investigation showcases a comparable or superior performance under its distinctive attention-based input token mixer strategy. Introducing adversarial examples as a robustness consideration has had a profound and detrimental impact on the performance of well-established convolution-based structures. This inherent vulnerability to adversarial attacks has also been demonstrated in Transformer-based structures. In this paper, our emphasis lies on investigating the intrinsic robustness of the structure rather than introducing novel defense measures against adversarial attacks. To address the susceptibility to robustness issues, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Bacillus and Francisella bacterial research
MethodsMulti-Head Attention · Attention Is All You Need · Layer Normalization · Label Smoothing · Dropout · Byte Pair Encoding · Absolute Position Encodings · Dense Connections · Linear Layer · Vision Transformer
