Fast Trainable Projection for Robust Fine-Tuning

Junjiao Tian; Yen-Cheng Liu; James Seale Smith; Zsolt Kira

arXiv:2310.19182·cs.CV·October 31, 2023·1 cites

Fast Trainable Projection for Robust Fine-Tuning

Junjiao Tian, Yen-Cheng Liu, James Seale Smith, Zsolt Kira

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces Fast Trainable Projection (FTP), a scalable and efficient projection-based fine-tuning method that enhances robustness and speed in transfer learning tasks across various vision benchmarks.

Contribution

FTP provides a novel, computationally efficient projection-based fine-tuning algorithm that improves robustness and speed, and can be integrated with existing optimizers in a plug-and-play manner.

Findings

01

FTP achieves 35% speedup over prior methods.

02

FTP improves robustness on OOD datasets across multiple vision tasks.

03

FTP is adaptable to low-label and continual learning scenarios.

Abstract

Robust fine-tuning aims to achieve competitive in-distribution (ID) performance while maintaining the out-of-distribution (OOD) robustness of a pre-trained model when transferring it to a downstream task. Recently, projected gradient descent has been successfully used in robust fine-tuning by constraining the deviation from the initialization of the fine-tuned model explicitly through projection. However, algorithmically, two limitations prevent this method from being adopted more widely, scalability and efficiency. In this paper, we propose a new projection-based fine-tuning algorithm, Fast Trainable Projection (FTP) for computationally efficient learning of per-layer projection constraints, resulting in an average $35%$ speedup on our benchmarks compared to prior works. FTP can be combined with existing optimizers such as AdamW, and be used in a plug-and-play fashion. Finally, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Fast Trainable Projection for Robust Fine-tuning· slideslive

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications