MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning

Hai-Long Sun; Da-Wei Zhou; Hanbin Zhao; Le Gan; De-Chuan Zhan; Han-Jia Ye

arXiv:2412.09441·cs.LG·June 19, 2025

MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning

Hai-Long Sun, Da-Wei Zhou, Hanbin Zhao, Le Gan, De-Chuan Zhan, Han-Jia Ye

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces MOS, a model surgery approach that uses task-specific adapters and a retrieval mechanism to prevent catastrophic forgetting in pre-trained models during class-incremental learning, achieving state-of-the-art results.

Contribution

MOS combines adapter merging and a self-refined retrieval mechanism to effectively mitigate both parameter and retrieval-level forgetting in PTMs for CIL.

Findings

01

MOS outperforms existing methods on seven benchmarks.

02

The adapter merging approach preserves task-specific information.

03

The self-refined retrieval improves adapter selection during inference.

Abstract

Class-Incremental Learning (CIL) requires models to continually acquire knowledge of new classes without forgetting old ones. Despite Pre-trained Models (PTMs) have shown excellent performance in CIL, catastrophic forgetting still occurs as the model learns new concepts. Existing work seeks to utilize lightweight components to adjust the PTM, while the forgetting phenomenon still comes from {\em parameter and retrieval} levels. Specifically, iterative updates of the model result in parameter drift, while mistakenly retrieving irrelevant modules leads to the mismatch during inference. To this end, we propose MOdel Surgery (MOS) to rescue the model from forgetting previous knowledge. By training task-specific adapters, we continually adjust the PTM to downstream tasks. To mitigate parameter-level forgetting, we present an adapter merging approach to learn task-specific adapters, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning· underline

Taxonomy

TopicsEducational Assessment and Pedagogy

MethodsAdapter