Offsite-Tuning: Transfer Learning without Full Model

Guangxuan Xiao; Ji Lin; Song Han

arXiv:2302.04870·cs.CL·February 10, 2023·21 cites

Offsite-Tuning: Transfer Learning without Full Model

Guangxuan Xiao, Ji Lin, Song Han

PDF

Open Access 1 Repo

TL;DR

Offsite-Tuning introduces a privacy-preserving, efficient transfer learning method that enables adaptation of large foundation models without full model access, achieving comparable accuracy with significant speed and memory improvements.

Contribution

The paper presents Offsite-Tuning, a novel framework allowing transfer learning without full model access, reducing computational costs and privacy risks.

Findings

01

Achieves comparable accuracy to full fine-tuning.

02

Provides 6.5x speedup and 5.6x memory reduction.

03

Works effectively on large language and vision models.

Abstract

Transfer learning is important for foundation models to adapt to downstream tasks. However, many foundation models are proprietary, so users must share their data with model owners to fine-tune the models, which is costly and raise privacy concerns. Moreover, fine-tuning large foundation models is computation-intensive and impractical for most downstream users. In this paper, we propose Offsite-Tuning, a privacy-preserving and efficient transfer learning framework that can adapt billion-parameter foundation models to downstream data without access to the full model. In offsite-tuning, the model owner sends a light-weight adapter and a lossy compressed emulator to the data owner, who then fine-tunes the adapter on the downstream data with the emulator's assistance. The fine-tuned adapter is then returned to the model owner, who plugs it into the full model to create an adapted foundation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mit-han-lab/offsite-tuning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Privacy-Preserving Technologies in Data

MethodsAdapter