CAT Merging: A Training-Free Approach for Resolving Conflicts in Model Merging
Wenju Sun, Qingyong Li, Yangli-ao Geng, Boyang Li

TL;DR
CAT Merging is a training-free method that improves multi-task model merging by selectively removing conflicting components from task vectors, leading to better performance across vision, language, and vision-language tasks.
Contribution
It introduces a novel conflict-aware framework that addresses knowledge conflicts in model merging without additional training, using parameter-specific strategies.
Findings
Achieves up to 2.5% accuracy improvement on vision tasks.
Effectively suppresses knowledge conflicts in multi-task merging.
Demonstrates versatility across vision, language, and vision-language tasks.
Abstract
Multi-task model merging offers a promising paradigm for integrating multiple expert models into a unified model without additional training. Existing state-of-the-art techniques, such as Task Arithmetic and its variants, merge models by accumulating task vectors -- the parameter differences between pretrained and finetuned models. However, task vector accumulation is often hindered by knowledge conflicts, leading to performance degradation. To address this challenge, we propose Conflict-Aware Task Merging (CAT Merging), a novel training-free framework that selectively trims conflict-prone components from the task vectors. CAT Merging introduces several parameter-specific strategies, including projection for linear weights and masking for scaling and shifting parameters in normalization layers. Extensive experiments on vision, language, and vision-language tasks demonstrate that CAT…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI) · Topic Modeling
