Memory-Efficient Continual Learning with CLIP Models

Ryan King; Gang Li; Bobak Mortazavi; Tianbao Yang

arXiv:2605.03866·cs.LG·May 6, 2026

Memory-Efficient Continual Learning with CLIP Models

Ryan King, Gang Li, Bobak Mortazavi, Tianbao Yang

PDF

TL;DR

This paper introduces a memory-efficient, distributionally robust method for continual learning with CLIP models, enabling quick adaptation and minimal forgetting even with limited memory buffers.

Contribution

It proposes a novel loss reweighting technique that improves CLIP's continual learning performance under small memory constraints.

Findings

01

Achieves effective class incremental learning on CIFAR-100 and ImageNet1K.

02

Maintains high performance with minimal memory buffer sizes.

03

Reduces catastrophic forgetting in domain incremental tasks.

Abstract

Contrastive Language-Image Pretraining (CLIP) models excel at understanding image-text relationships but struggle with adapting to new data without forgetting prior knowledge. To address this, models are typically fine-tuned using both new task data and a memory buffer of past tasks. However, CLIP's contrastive loss suffers when the memory buffer is small, leading to performance degradation on previous tasks. We propose a memory-efficient, distributionally robust method that dynamically reweights losses per class during training. Our approach, tested on class incremental settings (CIFAR-100, ImageNet1K) and a domain incremental setting (DomainNet) adapts CLIP models quickly while minimizing catastrophic forgetting, even with minimal memory usage.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.