Task Arithmetic in Trust Region: A Training-Free Model Merging Approach   to Navigate Knowledge Conflicts

Wenju Sun; Qingyong Li; Wen Wang; Yangli-ao Geng; Boyang Li

arXiv:2501.15065·cs.LG·January 28, 2025

Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts

Wenju Sun, Qingyong Li, Wen Wang, Yangli-ao Geng, Boyang Li

PDF

Open Access

TL;DR

This paper introduces TATR, a training-free model merging method that uses trust regions to mitigate knowledge conflicts in task arithmetic, enhancing multi-task model performance without additional training.

Contribution

The paper proposes TATR, a novel trust region approach that addresses knowledge conflicts in task arithmetic, improving multi-task model merging efficiency and effectiveness.

Findings

01

TATR reduces performance degradation caused by task conflicts.

02

TATR improves multi-task merging performance across eight datasets.

03

TATR is compatible with various TA-based methods.

Abstract

Multi-task model merging offers an efficient solution for integrating knowledge from multiple fine-tuned models, mitigating the significant computational and storage demands associated with multi-task training. As a key technique in this field, Task Arithmetic (TA) defines task vectors by subtracting the pre-trained model ( $θ_{pre}$ ) from the fine-tuned task models in parameter space, then adjusting the weight between these task vectors and $θ_{pre}$ to balance task-generalized and task-specific knowledge. Despite the promising performance of TA, conflicts can arise among the task vectors, particularly when different tasks require distinct model adaptations. In this paper, we formally define this issue as knowledge conflicts, characterized by the performance degradation of one task after merging with a model fine-tuned for another task. Through in-depth analysis,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Auction Theory and Applications · Reinforcement Learning in Robotics

MethodsALIGN