MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning
Hai-Long Sun, Da-Wei Zhou, Hanbin Zhao, Le Gan, De-Chuan Zhan, Han-Jia Ye

TL;DR
This paper introduces MOS, a model surgery approach that uses task-specific adapters and a retrieval mechanism to prevent catastrophic forgetting in pre-trained models during class-incremental learning, achieving state-of-the-art results.
Contribution
MOS combines adapter merging and a self-refined retrieval mechanism to effectively mitigate both parameter and retrieval-level forgetting in PTMs for CIL.
Findings
MOS outperforms existing methods on seven benchmarks.
The adapter merging approach preserves task-specific information.
The self-refined retrieval improves adapter selection during inference.
Abstract
Class-Incremental Learning (CIL) requires models to continually acquire knowledge of new classes without forgetting old ones. Despite Pre-trained Models (PTMs) have shown excellent performance in CIL, catastrophic forgetting still occurs as the model learns new concepts. Existing work seeks to utilize lightweight components to adjust the PTM, while the forgetting phenomenon still comes from {\em parameter and retrieval} levels. Specifically, iterative updates of the model result in parameter drift, while mistakenly retrieving irrelevant modules leads to the mismatch during inference. To this end, we propose MOdel Surgery (MOS) to rescue the model from forgetting previous knowledge. By training task-specific adapters, we continually adjust the PTM to downstream tasks. To mitigate parameter-level forgetting, we present an adapter merging approach to learn task-specific adapters, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsEducational Assessment and Pedagogy
MethodsAdapter
