Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters
Jiazuo Yu, Yunzhi Zhuge, Lu Zhang, Ping Hu, Dong Wang, Huchuan Lu and, You He

TL;DR
This paper introduces a parameter-efficient continual learning framework for vision-language models that dynamically expands with Mixture-of-Experts adapters and uses a novel auto-selector to maintain performance and reduce training costs.
Contribution
It proposes a novel dynamic expansion method with MoE adapters and a distribution auto-selector to improve continual learning in vision-language models, reducing parameter updates by 60%.
Findings
Outperforms previous state-of-the-art methods in continual learning tasks.
Reduces parameter training burden by 60%.
Maintains zero-shot recognition capabilities during incremental learning.
Abstract
Continual learning can empower vision-language models to continuously acquire new knowledge, without the need for access to the entire historical dataset. However, mitigating the performance degradation in large-scale models is non-trivial due to (i) parameter shifts throughout lifelong learning and (ii) significant computational burdens associated with full-model tuning. In this work, we present a parameter-efficient continual learning framework to alleviate long-term forgetting in incremental learning with vision-language models. Our approach involves the dynamic expansion of a pre-trained CLIP model, through the integration of Mixture-of-Experts (MoE) adapters in response to new tasks. To preserve the zero-shot recognition capability of vision-language models, we further introduce a Distribution Discriminative Auto-Selector (DDAS) that automatically routes in-distribution and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · COVID-19 diagnosis using AI
MethodsMixture of Experts · Adapter · Contrastive Language-Image Pre-training
