Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting

Hyeonseo Jang; Hyuk Kwon; and Kibok Lee

arXiv:2604.18075·cs.CV·April 21, 2026

Enhancing Continual Learning of Vision-Language Models via Dynamic Prefix Weighting

Hyeonseo Jang, Hyuk Kwon, and Kibok Lee

PDF

1 Repo

TL;DR

This paper introduces Dynamic Prefix Weighting (DPW), a novel method for improving continual learning in vision-language models by dynamically adjusting prefix weights and selectively utilizing adapters, leading to state-of-the-art results.

Contribution

The paper proposes DPW, a framework that dynamically weights prefixes and adapters based on input importance, enhancing continual learning in vision-language models.

Findings

01

Achieves state-of-the-art performance in domain-class incremental learning for VLMs.

02

Demonstrates effective dynamic adjustment of prefix and adapter weights.

03

Improves model adaptation to downstream tasks.

Abstract

We investigate recently introduced domain-class incremental learning scenarios for vision-language models (VLMs). Recent works address this challenge using parameter-efficient methods, such as prefix-tuning or adapters, which facilitate model adaptation to downstream tasks by incorporating task-specific information into input tokens through additive vectors. However, previous approaches often normalize the weights of these vectors, disregarding the fact that different input tokens require different degrees of adjustment. To overcome this issue, we propose Dynamic Prefix Weighting (DPW), a framework that dynamically assigns weights to prefixes, complemented by adapters. DPW consists of 1) a gating module that adjusts the weights of each prefix based on the importance of the corresponding input token, and 2) a weighting mechanism that derives adapter output weights as a residual of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YonseiML/dpw
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.