Tunable MAGMAX: Preference-Aware Model Merging for Continual Learning

Kei Hiroshima; Kento Uchida; Shinichi Shirakawa

arXiv:2605.20803·cs.LG·May 21, 2026

Tunable MAGMAX: Preference-Aware Model Merging for Continual Learning

Kei Hiroshima, Kento Uchida, Shinichi Shirakawa

PDF

TL;DR

Tunable MAGMAX introduces a preference-aware model merging framework for continual learning, enabling dynamic adjustment of task performance based on deployment needs, with automatic preference vector construction.

Contribution

It proposes a novel preference vector mechanism and an automatic construction method for model merging in continual learning, addressing environment-specific performance requirements.

Findings

01

Effectively controls task-wise performance in CL models.

02

Automatically constructs preference vectors using minimal target environment data.

03

Achieves superior or comparable performance to baseline methods.

Abstract

Continual learning (CL) aims to train models sequentially on multiple tasks while mitigating catastrophic forgetting of previously learned knowledge. Recent advances in large pre-trained models (LPMs) and model merging techniques, such as MAGMAX, have demonstrated effective CL performance by combining task-specific parameters. However, existing methods primarily focus on average performance across all tasks and do not adequately address how to construct models accommodating different deployment environments or varying user preferences. This paper proposes a model merging framework, termed Tunable MAGMAX, which enables preference-aware control of task-specific performance in CL. Our method introduces a preference vector that controls the number of elements selected from each task vector during model merging, allowing us to adjust the merged model performance according to their deployment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.