RegCL: Continual Adaptation of Segment Anything Model via Model Merging
Yuan-Chen Shu, Zhiwei Lin, Yongtao Wang

TL;DR
RegCL introduces a model merging framework for continual learning that consolidates multi-domain knowledge in the Segment Anything Model without increasing model size or requiring historical data, addressing catastrophic forgetting.
Contribution
It proposes a novel non-replay continual learning method using model merging to efficiently adapt SAM across multiple domains.
Findings
Achieves effective multi-domain knowledge integration.
Maintains constant model size regardless of tasks.
Outperforms existing methods in continual learning scenarios.
Abstract
To address the performance limitations of the Segment Anything Model (SAM) in specific domains, existing works primarily adopt adapter-based one-step adaptation paradigms. However, some of these methods are specific developed for specific domains. If used on other domains may lead to performance degradation. This issue of catastrophic forgetting severely limits the model's scalability. To address this issue, this paper proposes RegCL, a novel non-replay continual learning (CL) framework designed for efficient multi-domain knowledge integration through model merging. Specifically, RegCL incorporates the model merging algorithm into the continual learning paradigm by merging the parameters of SAM's adaptation modules (e.g., LoRA modules) trained on different domains. The merging process is guided by weight optimization, which minimizes prediction discrepancies between the merged model and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsService-Oriented Architecture and Web Services · Business Process Modeling and Analysis · Advanced Database Systems and Queries
MethodsADaptive gradient method with the OPTimal convergence rate
