Tunable MAGMAX: Preference-Aware Model Merging for Continual Learning
Kei Hiroshima, Kento Uchida, Shinichi Shirakawa

TL;DR
Tunable MAGMAX introduces a preference-aware model merging framework for continual learning, enabling dynamic adjustment of task performance based on deployment needs, with automatic preference vector construction.
Contribution
It proposes a novel preference vector mechanism and an automatic construction method for model merging in continual learning, addressing environment-specific performance requirements.
Findings
Effectively controls task-wise performance in CL models.
Automatically constructs preference vectors using minimal target environment data.
Achieves superior or comparable performance to baseline methods.
Abstract
Continual learning (CL) aims to train models sequentially on multiple tasks while mitigating catastrophic forgetting of previously learned knowledge. Recent advances in large pre-trained models (LPMs) and model merging techniques, such as MAGMAX, have demonstrated effective CL performance by combining task-specific parameters. However, existing methods primarily focus on average performance across all tasks and do not adequately address how to construct models accommodating different deployment environments or varying user preferences. This paper proposes a model merging framework, termed Tunable MAGMAX, which enables preference-aware control of task-specific performance in CL. Our method introduces a preference vector that controls the number of elements selected from each task vector during model merging, allowing us to adjust the merged model performance according to their deployment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
