Towards Efficient Visual Adaption via Structural Re-parameterization
Gen Luo, Minglang Huang, Yiyi Zhou, Xiaoshuai Sun, Guannan Jiang,, Zhiyu Wang, Rongrong Ji

TL;DR
This paper introduces RepAdapter, a structural re-parameterization method that enables parameter-efficient and inference-friendly adaptation of large vision models, significantly reducing latency and resource costs while maintaining high performance.
Contribution
The paper proposes RepAdapter, a novel structural re-parameterization approach that allows seamless integration into giant vision models for zero-cost inference and improved efficiency.
Findings
Outperforms state-of-the-art PETL methods on 27 vision benchmarks.
Reduces training time by up to 25% and GPU memory by 20%.
Achieves 7.2% higher accuracy than full tuning on average.
Abstract
Parameter-efficient transfer learning (PETL) is an emerging research spot aimed at inexpensively adapting large-scale pre-trained models to downstream tasks. Recent advances have achieved great success in saving storage costs for various pre-trained models by updating a small number of parameters instead of full tuning. However, we notice that most existing PETL methods still incur non-negligible latency during inference. In this paper, we propose a parameter-efficient and computational friendly adapter for giant vision models, called RepAdapter. Specifically, we first prove that common adaptation modules can also be seamlessly integrated into most giant vision models via our structural re-parameterization, thereby achieving zero-cost during inference. We then investigate the sparse design and effective placement of adapter structure, helping our RepAdaper obtain other advantages in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Human Pose and Action Recognition
MethodsAdapter · Contrastive Language-Image Pre-training
