Are Fast Methods Stable in Adversarially Robust Transfer Learning?

Joshua C. Zhao; Saurabh Bagchi

arXiv:2506.22602·cs.LG·July 1, 2025

Are Fast Methods Stable in Adversarially Robust Transfer Learning?

Joshua C. Zhao, Saurabh Bagchi

PDF

Open Access

TL;DR

This paper investigates the stability and efficiency of using the fast gradient sign method (FGSM) for adversarially robust transfer learning, finding it more stable and faster than PGD with minimal robustness loss.

Contribution

The study reveals that FGSM is more stable and computationally efficient than PGD in adversarial transfer learning, especially with parameter-efficient fine-tuning methods.

Findings

01

FGSM does not suffer from catastrophic overfitting at standard perturbation budgets.

02

FGSM remains stable up to higher perturbation levels with parameter-efficient fine-tuning.

03

FGSM achieves comparable robustness to PGD while requiring four times less training time.

Abstract

Transfer learning is often used to decrease the computational cost of model training, as fine-tuning a model allows a downstream task to leverage the features learned from the pre-training dataset and quickly adapt them to a new task. This is particularly useful for achieving adversarial robustness, as adversarially training models from scratch is very computationally expensive. However, high robustness in transfer learning still requires adversarial training during the fine-tuning phase, which requires up to an order of magnitude more time than standard fine-tuning. In this work, we revisit the use of the fast gradient sign method (FGSM) in robust transfer learning to improve the computational cost of adversarial fine-tuning. We surprisingly find that FGSM is much more stable in adversarial fine-tuning than when training from scratch. In particular, FGSM fine-tuning does not suffer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Generative Adversarial Networks and Image Synthesis