Offsite-Tuning: Transfer Learning without Full Model
Guangxuan Xiao, Ji Lin, Song Han

TL;DR
Offsite-Tuning introduces a privacy-preserving, efficient transfer learning method that enables adaptation of large foundation models without full model access, achieving comparable accuracy with significant speed and memory improvements.
Contribution
The paper presents Offsite-Tuning, a novel framework allowing transfer learning without full model access, reducing computational costs and privacy risks.
Findings
Achieves comparable accuracy to full fine-tuning.
Provides 6.5x speedup and 5.6x memory reduction.
Works effectively on large language and vision models.
Abstract
Transfer learning is important for foundation models to adapt to downstream tasks. However, many foundation models are proprietary, so users must share their data with model owners to fine-tune the models, which is costly and raise privacy concerns. Moreover, fine-tuning large foundation models is computation-intensive and impractical for most downstream users. In this paper, we propose Offsite-Tuning, a privacy-preserving and efficient transfer learning framework that can adapt billion-parameter foundation models to downstream data without access to the full model. In offsite-tuning, the model owner sends a light-weight adapter and a lossy compressed emulator to the data owner, who then fine-tunes the adapter on the downstream data with the emulator's assistance. The fine-tuned adapter is then returned to the model owner, who plugs it into the full model to create an adapted foundation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Privacy-Preserving Technologies in Data
MethodsAdapter
