Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision
Minglei Li, Peng Ye, Yongqi Huang, Lin Zhang, Tao Chen, Tong He,, Jiayuan Fan, Wanli Ouyang

TL;DR
Adapter-X is a new parameter-efficient fine-tuning framework for vision tasks that outperforms full fine-tuning in 2D and 3D modalities with minimal trainable parameters by combining dynamic adapter sharing and block-specific designs.
Contribution
It introduces Adapter-X, a novel framework that integrates token-level dynamic allocation, parameter sharing, and block-specific modules for improved efficiency and performance.
Findings
Outperforms full fine-tuning in 2D and 3D tasks
Uses only 0.20% and 1.88% of trainable parameters
Demonstrates significant efficiency and generalization improvements
Abstract
Parameter-efficient fine-tuning (PEFT) has become increasingly important as foundation models continue to grow in both popularity and size. Adapter has been particularly well-received due to their potential for parameter reduction and adaptability across diverse tasks. However, striking a balance between high efficiency and robust generalization across tasks remains a challenge for adapter-based methods. We analyze existing methods and find that: 1) parameter sharing is the key to reducing redundancy; 2) more tunable parameters, dynamic allocation, and block-specific design are keys to improving performance. Unfortunately, no previous work considers all these factors. Inspired by this insight, we introduce a novel framework named Adapter-X. First, a Sharing Mixture of Adapters (SMoA) module is proposed to fulfill token-level dynamic allocation, increased tunable parameters, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
MethodsAdapter
