Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts
Wenju Sun, Qingyong Li, Wen Wang, Yangli-ao Geng, Boyang Li

TL;DR
This paper introduces TATR, a training-free model merging method that uses trust regions to mitigate knowledge conflicts in task arithmetic, enhancing multi-task model performance without additional training.
Contribution
The paper proposes TATR, a novel trust region approach that addresses knowledge conflicts in task arithmetic, improving multi-task model merging efficiency and effectiveness.
Findings
TATR reduces performance degradation caused by task conflicts.
TATR improves multi-task merging performance across eight datasets.
TATR is compatible with various TA-based methods.
Abstract
Multi-task model merging offers an efficient solution for integrating knowledge from multiple fine-tuned models, mitigating the significant computational and storage demands associated with multi-task training. As a key technique in this field, Task Arithmetic (TA) defines task vectors by subtracting the pre-trained model () from the fine-tuned task models in parameter space, then adjusting the weight between these task vectors and to balance task-generalized and task-specific knowledge. Despite the promising performance of TA, conflicts can arise among the task vectors, particularly when different tasks require distinct model adaptations. In this paper, we formally define this issue as knowledge conflicts, characterized by the performance degradation of one task after merging with a model fine-tuned for another task. Through in-depth analysis,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Auction Theory and Applications · Reinforcement Learning in Robotics
MethodsALIGN
