LADA: Scalable Label-Specific CLIP Adapter for Continual Learning

Mao-Lin Luo; Zi-Hao Zhou; Tong Wei; Min-Ling Zhang

arXiv:2505.23271·cs.CV·May 30, 2025

LADA: Scalable Label-Specific CLIP Adapter for Continual Learning

Mao-Lin Luo, Zi-Hao Zhou, Tong Wei, Min-Ling Zhang

PDF

Open Access 1 Repo

TL;DR

LADA introduces a scalable, label-specific adapter for CLIP that enhances continual learning by adding lightweight, task-specific memory units, preventing forgetting and improving discriminative feature generation without modifying the original encoder.

Contribution

LADA proposes a novel label-specific adapter that enables scalable continual learning with CLIP by appending memory units, avoiding parameter partitioning and reducing inference errors.

Findings

01

Achieves state-of-the-art continual learning performance

02

Effectively prevents catastrophic forgetting

03

Maintains efficient training with frozen CLIP encoder

Abstract

Continual learning with vision-language models like CLIP offers a pathway toward scalable machine learning systems by leveraging its transferable representations. Existing CLIP-based methods adapt the pre-trained image encoder by adding multiple sets of learnable parameters, with each task using a partial set of parameters. This requires selecting the expected parameters for input images during inference, which is prone to error that degrades performance. To address this problem, we introduce LADA (Label-specific ADApter). Instead of partitioning parameters across tasks, LADA appends lightweight, label-specific memory units to the frozen CLIP image encoder, enabling discriminative feature generation by aggregating task-agnostic knowledge. To prevent catastrophic forgetting, LADA employs feature distillation for seen classes, preventing their features from being interfered with by new…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maolinluo/lada
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Multimodal Machine Learning Applications

MethodsContrastive Language-Image Pre-training · Sparse Evolutionary Training