Efficient Adversarial Training via Criticality-Aware Fine-Tuning
Wenyun Li, Zheng Zhang, Dongmei Jiang, Yaowei Wang, Xiangyuan Lan

TL;DR
This paper introduces CAAT, a method for efficiently fine-tuning only the most critical parameters of Vision Transformers to improve adversarial robustness while significantly reducing computational costs.
Contribution
CAAT adaptively identifies and fine-tunes only the most robustness-critical parameters, enabling scalable adversarial training with fewer trainable parameters.
Findings
CAAT achieves comparable robustness with only 6% of parameters tuned.
CAAT incurs just a 4.3% decrease in robustness compared to full adversarial training.
Outperforms state-of-the-art lightweight adversarial training methods on multiple datasets.
Abstract
Vision Transformer (ViT) models have achieved remarkable performance across various vision tasks, with scalability being a key advantage when applied to large datasets. This scalability enables ViT models to exhibit strong generalization capabilities. However, as the number of parameters increases, the robustness of ViT models to adversarial examples does not scale proportionally. Adversarial training (AT), one of the most effective methods for enhancing robustness, typically requires fine-tuning the entire model, leading to prohibitively high computational costs, especially for large ViT architectures. In this paper, we aim to robustly fine-tune only a small subset of parameters to achieve robustness comparable to standard AT. To accomplish this, we introduce Criticality-Aware Adversarial Training (CAAT), a novel method that adaptively allocates resources to the most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
