Skip Tuning: Pre-trained Vision-Language Models are Effective and   Efficient Adapters Themselves

Shihan Wu; Ji Zhang; Pengpeng Zeng; Lianli Gao; Jingkuan Song; Heng; Tao Shen

arXiv:2412.11509·cs.CV·March 17, 2025

Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves

Shihan Wu, Ji Zhang, Pengpeng Zeng, Lianli Gao, Jingkuan Song, Heng, Tao Shen

PDF

Open Access 1 Repo

TL;DR

Skip Tuning is a new method for adapting pre-trained vision-language models that reduces computational complexity by skipping certain layers and class-wise features, outperforming prompt tuning and adapters.

Contribution

This work introduces Skip Tuning, a novel adaptation paradigm that improves transfer efficiency without extra modules or context vectors, unlike existing prompt tuning methods.

Findings

01

Skip Tuning outperforms prompt tuning and adapter-based methods.

02

It achieves better transferability and efficiency across various benchmarks.

03

The method reduces feature-gradient flow complexity effectively.

Abstract

Prompt tuning (PT) has long been recognized as an effective and efficient paradigm for transferring large pre-trained vision-language models (VLMs) to downstream tasks by learning a tiny set of context vectors. Nevertheless, in this work, we reveal that freezing the parameters of VLMs during learning the context vectors neither facilitates the transferability of pre-trained knowledge nor improves the memory and time efficiency significantly. Upon further investigation, we find that reducing both the length and width of the feature-gradient propagation flows of the full fine-tuning (FT) baseline is key to achieving effective and efficient knowledge transfer. Motivated by this, we propose Skip Tuning, a novel paradigm for adapting VLMs to downstream tasks. Unlike existing PT or adapter-based methods, Skip Tuning applies Layer-wise Skipping (LSkip) and Class-wise Skipping (CSkip) upon the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

koorye/skiptuning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications

MethodsSparse Evolutionary Training · Adapter