A Simple Efficiency Incremental Learning Framework via Vision-Language Model with Nonlinear Multi-Adapters

Haihua Luo; Xuming Ran; Jiangrong Shen; Timo H\"am\"al\"ainen; Zhonghua Chen; Qi Xu; Fengyu Cong

arXiv:2603.11211·cs.CV·March 20, 2026

A Simple Efficiency Incremental Learning Framework via Vision-Language Model with Nonlinear Multi-Adapters

Haihua Luo, Xuming Ran, Jiangrong Shen, Timo H\"am\"al\"ainen, Zhonghua Chen, Qi Xu, Fengyu Cong

PDF

Open Access

TL;DR

This paper introduces SimE, a simple and efficient incremental learning framework using vision-language models with nonlinear adapter connections, improving training efficiency and performance without relying on large memory banks or strong backbones.

Contribution

Proposes SimE, a novel incremental learning framework with nonlinear adapter connections, demonstrating improved efficiency and performance over traditional methods and CLIP-based approaches.

Findings

01

SimE surpasses traditional methods by 9.6% on TinyImageNet.

02

SimE outperforms other CLIP-based methods by 5.3% on CIFAR-100.

03

Nonlinear correlation exists between adapter connections and IL capabilities.

Abstract

Incremental Learning (IL) aims to learn new tasks while preserving previously acquired knowledge. Integrating the zero-shot learning capabilities of pre-trained vision-language models into IL methods has marked a significant advancement. However, these methods face three primary challenges: (1) the need for improved training efficiency; (2) reliance on a memory bank to store previous data; and (3) the necessity of a strong backbone to augment the model's capabilities. In this paper, we propose SimE, a Simple and Efficient framework that employs a vision-language model with adapters designed specifically for the IL task. We report a remarkable phenomenon: there is a nonlinear correlation between the number of adaptive adapter connections and the model's IL capabilities. While increasing adapter connections between transformer blocks improves model performance, adding more adaptive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications