AdapterTune: Zero-Initialized Low-Rank Adapters for Frozen Vision Transformers
Salim Khazem

TL;DR
AdapterTune introduces zero-initialized low-rank adapters for frozen Vision Transformers, enhancing transfer learning stability and efficiency by starting from the pretrained state and providing a principled capacity control.
Contribution
It proposes a novel residual low-rank adapter design with zero initialization, formalizes adapter rank as a capacity measure, and demonstrates significant accuracy gains with minimal additional parameters.
Findings
Improves top-1 accuracy by +14.9 points on average over head-only transfer.
Outperforms full fine-tuning on 10 of 15 dataset-backbone pairs.
Achieves these results with only 0.92 of the parameters used in full fine-tuning.
Abstract
Frozen-backbone transfer with Vision Transformers faces two under-addressed issues: optimization instability when adapters are naively inserted into a fixed feature extractor, and the absence of principled guidance for setting adapter capacity. We introduce AdapterTune, which augments each transformer block with a residual low-rank bottleneck whose up-projection is zero-initialized, guaranteeing that the adapted network starts exactly at the pretrained function and eliminates early-epoch representation drift. On the analytical side, we formalize adapter rank as a capacity budget for approximating downstream task shifts in feature space. The resulting excess-risk decomposition predicts monotonic but diminishing accuracy gains with increasing rank, an ``elbow'' behavior we confirm through controlled sweeps. We evaluate on 9 datasets and 3 backbone scales with multi-seed reporting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Neural Network Applications · Image Enhancement Techniques
