Parameter Competition Balancing for Model Merging
Guodong Du, Junlin Lee, Jing Li, Runhua Jiang, Yifei Guo, Shuyang Yu,, Hanting Liu, Sim Kuan Goh, Ho-Kin Tang, Daojing He, Min Zhang

TL;DR
This paper presents PCB-Merging, a lightweight, training-free method for effectively merging multiple fine-tuned models by balancing parameter competition, leading to improved multitasking performance across diverse scenarios.
Contribution
Introduces PCB-Merging, a novel parameter balancing technique that adjusts parameter importance for better model merging without retraining.
Findings
Outperforms existing model merging methods across various tasks and domains.
Enhances out-of-domain generalization and multitasking capabilities.
Effective across different model sizes and modalities.
Abstract
While fine-tuning pretrained models has become common practice, these models often underperform outside their specific domains. Recently developed model merging techniques enable the direct integration of multiple models, each fine-tuned for distinct tasks, into a single model. This strategy promotes multitasking capabilities without requiring retraining on the original datasets. However, existing methods fall short in addressing potential conflicts and complex correlations between tasks, especially in parameter-level adjustments, posing a challenge in effectively balancing parameter competition across various tasks. This paper introduces an innovative technique named PCB-Merging (Parameter Competition Balancing), a lightweight and training-free technique that adjusts the coefficients of each parameter for effective model merging. PCB-Merging employs intra-balancing to gauge parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Simulation Techniques and Applications · Model-Driven Software Engineering Techniques
