Fast Trainable Projection for Robust Fine-Tuning
Junjiao Tian, Yen-Cheng Liu, James Seale Smith, Zsolt Kira

TL;DR
This paper introduces Fast Trainable Projection (FTP), a scalable and efficient projection-based fine-tuning method that enhances robustness and speed in transfer learning tasks across various vision benchmarks.
Contribution
FTP provides a novel, computationally efficient projection-based fine-tuning algorithm that improves robustness and speed, and can be integrated with existing optimizers in a plug-and-play manner.
Findings
FTP achieves 35% speedup over prior methods.
FTP improves robustness on OOD datasets across multiple vision tasks.
FTP is adaptable to low-label and continual learning scenarios.
Abstract
Robust fine-tuning aims to achieve competitive in-distribution (ID) performance while maintaining the out-of-distribution (OOD) robustness of a pre-trained model when transferring it to a downstream task. Recently, projected gradient descent has been successfully used in robust fine-tuning by constraining the deviation from the initialization of the fine-tuned model explicitly through projection. However, algorithmically, two limitations prevent this method from being adopted more widely, scalability and efficiency. In this paper, we propose a new projection-based fine-tuning algorithm, Fast Trainable Projection (FTP) for computationally efficient learning of per-layer projection constraints, resulting in an average speedup on our benchmarks compared to prior works. FTP can be combined with existing optimizers such as AdamW, and be used in a plug-and-play fashion. Finally, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
